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FIFTH DECLARATION OF DR. GREGORY BLECK 



I, Dr. Gregory Bleck, state as follows: 

1 . My present position is Senior Director, Cell Line Engineering, Catalent Pharma 
Solutions. 

2. I am an inventor of the above referenced patent application. 

3. At page 7 of the Office Action, the Examiner recognizes that Mathor and Burns do not 
teach serial transduction to obtain cells comprising genomes with 20 to 100 integrated vectors. 
The Examiner then goes on to present arguments as why a person of skill in the art would use 
serial transduction to obtain cells comprising genomes with 20 to 1 00 integrated retroviral 
vectors. The Examiner supports this argument by citations to several publications. However, 
with one exception, the Examiner has made assumptions that are not factually supported by those 
references. The one exception is that the references do teach that there is a positive correlation 
between MOI and integration events. Many of the other assumptions made by the Examiner 
overstate what the references teach, especially in relation to the invention. 

4. At page 7, the Examiner states that Mathor et al. do teach that protein expression is 

directly proportional to integration events (i.e., copy number)(p. 10376, column 1). The 

Examiner goes on to state that: 

"It would have been obvious to one of skill in the art, at the time the invention was made, 
to modify the method of Mathor et al. and Burns et al. by serially transducing their cells 
with high MOls (such as MOls of 1,000) to achieve the claimed ranges of integration 



PATENT 

Attorney Docket No. GALA-08484 

events, with a reasonable expectation of success. The motivation to do so is provided by 
Mathor et aL, who teach the possibility of specifying the level of transgene expression by 
controlling the integration events (Abstract, p. 10376, column 1). 

The Examiner cites the abstract and p. 10376 of Mathor. Mathor et al. state in this section that 
"The rate of secretion of the exogenous protein by cultures generated by single clones was 
proportional to the number of integrations per progenitor cell." 

5. The Examiner has overstated the commentary and data in Mathor et al.. It is a fact that 
Mathor et al. does not contain any data or any statement that the level of transgene expression 
can be controlled by controlling integration events with the range of 20 to 100 integrations. 
Mathor et al. presents the data on pro viral integration and transgene expression on p. 10373 and 
in Table 1 . This data shows increasing transgene expression as the proviral integrations increase 
from 1 to 8. When the number of proviral integrations increases to 15 5 the transgene expression is 
actually decreases to a level lower than was observed with 8 integrations. Thus, as a factual 
matter, Mathor et al. teaches that transgene expression correlates with number of integrations 
over the range of 1 to 8 integrations. Transgene expression decreased when a cell line with 15 
integrations was analyzed. As a result, Mathor 5 s statement that "The rate of secretion of the 
exogenous protein by cultures generated by single clones was proportional to the number of 
integrations per progenitor cell" is valid with respect to the range of 1 to 8 integrations and does 
not apply outside of that range. The Examiner's attempt to apply the statement outside of the 
range is not factually supported, i.e., supported by the data. 

6. None of the other references relied on by the Examiner teach a correlation of transgene 
expression to integration number in the claimed range. Schott et al. teaches a correlation over the 
range of 1 to 9 integrations. See p. 304, Fig. 9. The increase between 4 and 5 integrations and 6 
and 7 integration is much greater than the increase between 7 and 9 integrations. This is similar 
to the Mathor et al. data and indicates that transgene expression levels off as opposed to 
continuing to increase, although, as explained in more detail below, predications outside of the 
data range cannot be validly made. 



PATENT 

Attorney Docket No. GALA-08484 

7. I have addressed the data in Mathor et al. in my previous Declaration. The Examiner 

addresses the previous Declaration at pages 13-14 of the Office Action. The Examiner states: 

The applicant argues that the fact that different clones can produce different amounts of 
protein has no relevance to whether a person of skill in the art would modify Mathor and 
make clones with 20 or more integrated retroviral vectors. This is not found persuasive. 
On the contrary, such knowledge in the art does have relevance to whether one of skill in 
the art would make clones with 20 or more integrated retroviral vectors. The argument 
that the data in Table 1 of Mathor et al., which is limited to a maximum of 
15 integrations cannot be extrapolated to a situation where there are 20 integrations is 
just an argument not supported by any evidence. Based on the teachings in the prior 
art (including Liu, Stamps, and Mathor et al.), one of skill in the art would have known 
that protein production is proportional to the number of integrated copies and that 
retroviral insertion is random and that expression level is dependent on the insertion 
sites; therefore, one of skill in the art would not conclude that the data in Table 1 
indicates a maximum of 15 integrations. Based on the teachings in the art as a whole, 
one of skill in the art would have had reasonably expected that clones comprising more 
than 1 5 integrations would express higher amounts of protein and would have known to 
look for several clones having higher integration numbers and select the high producer 
clones. 

The Examiner states that the argument that the data in Table 1 of Mathor et al., which is limited 
to a maximum of 15 integrations cannot be extrapolated to a situation where there are 20 
integrations is just an argument not supported by any evidence. This is not true. First, the data in 
Table 1 of Mathor et al. show that a clone with 15 integrations has a lower level of expression 
than a clone with 8 integrations. This is a fact. It is also a fact that the experiments in Mathor et 
al. were not conducted in a manner so that a statistical analysis could be conducted. The groups 
were not replicated and there is no way to determine experimental error. Thus, it is not possible 
to construct a curve or equation from the data so that a correlation of transgene expression to a 
number of integrations outside of the data range (i.e., 20 to 100 integrations) can be made. Any 
attempt to do so is speculation without a factual basis. For example, based on the data in Mathor, 
it is speculation as to whether another clone with 1 5 integrations would have a level of transgene 
expression that is higher or lower than the reported clone. The reason for this is that the data is 
not amenable to statistical analysis so that such a prediction can be made. 
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This is the reason why the Examiner is incorrect in arguing that "one of skill in the art would 
have had reasonably expected that clones comprising more than 15 integrations would express 
higher amounts of protein and would have known to look for several clones having higher 
integration numbers and select the high producer clones." The data in Mathor et al. is not 
amenable to this assumption. Because the experiments were not replicated and because there is 
not data on multiple clones with 15 (or a similar number) of integrations, it is not possible to 
predict or comment on the amount of expression one could expect from another clone with 15 
integrations. The single clone reported in Mathor et al. could be an example of the upper limit of 
expression, the median level of expression, or a low level of expression. The fact is, without 
additional data, one cannot know whether other clones with 15 integrations would have higher 
levels of expression than the observed clone. All that one knows for sure is the fact that the 
Mathor et al. data shows that expression from the clone with 15 integrations was lower than the 
expression observed in the clone with 8 integrations. 

The Examiner further states that based on the teachings in the prior art (including Liu, Stamps, 
and Mathor et al.), one of skill in the art would have known that protein production is 
proportional to the number of integrated copies and that retroviral insertion is random and that 
expression level is dependent on the insertion sites; therefore, one of skill in the art would not 
conclude that the data in Table 1 indicates a maximum of 15 integrations. 

As explained above, the data in Mathor et al. indicates that transgene expression increases with 
increases number of integrations up to 8 integrations. Transgene expression decreased when the 
number of integrations increased to 15. No other conclusions can be made based on this data. 
Furthermore, Lui et al. contains data on the correlation of expression of transgenes separated by 
an IRES and does not address transgene expression correlated to number of integrations. See 
Abstract, Fig. 2, Fig. 4, Fig. 5. Stamps et al. examined the role of the T-antigen gene and its site 
of integration in human epithelial cell immortalization, p. 871, Col. 2, first full para. The cells 
examined had up to five integrations. See Fig. 2 and Fig. 3. Stamps et al. does not comment on 
a correlation of transgene expression to number of integrations. These references do not provide 
factual support for the Examiner's argument. 
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8. In my previous Declaration, I provided evidence showing that at the time of the 

invention, the state of the art was that methylation of integrated retroviral vectors posed serious 
limitations on the use of the vectors for expression of transgenes. In response, the Examiner 
states: 



The applicant argues that many of the references cited by Bestor and those 
included in Paragraph 4 of the fourth Declaration describe silencing in vitro due to 
methylation. This is not found persuasive for the same reasons as above. Specifically, 
the prior art teaches that methylation is dependent on the integration site, i.e., 
consistent with the teachings of Liu, Stamps, and Mathor et al. that expression level is 
dependent on the insertion sites. Gunzburg et al. (The EMBO Journal, 1984, 3: 1 129- 
1135) teach that retroviral integration is random and take place either in active (i.e., the 
virus is expressed) or in inactive (i.e., the virus is not expressed) chromatin domains 
(see p. 1 129, paragraph bridging columns 1 and 2, p. 1 133, column 2, p. 1 134, column 
1). Based on these teachings, one of skill in the art would have known that the same 
number of integrations would result in different expression levels, depending on the 
insertion site. Furthermore, the prior art teaches that the expression and stability of the 
gene of interest directly correlates with the number of integrated retroviral vectors (see 
Schott et al. above). One of skill in the art would have known to look for clones 
comprising high numbers of integrated retroviral vectors and select the ones capable of 
producing high amounts of protein. 

Gunzburg et al., which was published in 1984, addresses methylation of "multiple endogenous 
mouse mammary tumour virus (MMTV) proviral genes" that "are present at different locations in 
mouse inbred strains." See Abstract. Gunzburg et al. finds that the methylation patterns are 
location and tissue specific and that the patterns are stably inherited and appear to be conferred 
upon the viral DNA by the flanking mouse genomic DNA. See Abstract. The authors state that 
"upon integration the provirus assumes the methylation pattern of the DNA into which is 
integrates." p. 1 129, col. 1., p. 133, col. 2. Importantly, Gunzburg et al. does not contain data or 
comments that address any correlation of methylation to expression of genes. Just as important, 
Gunzburg et al. addresses endogenous proviral sequences and not the introduction of exogenous 
vectors containing transgenes. These proviral sequences are endogenous to the genome and have 
been acquired at some point in the distant past. Gunzburg et al. has very little relevance to the 
present invention or to the evidence I previously submitted. 

Scientific papers that are more contemporary to this invention do contain relevant evidence. 



PATENT 

Attorney Docket No. GALA-08484 



Bestor and Tycko 1996 (attached at Tab 1), identify two hypothetical roles of genomic 
methylation patterns. The first is a role of programmed demethylation and methylation during 
development, p. 363 , col. 1. The second role is that cytosine methylation is part of a genome 
defense system which inactivates parasitic sequences such as transposable elements and proviral 
DNA (i.e., integrated retroviruses), p. 363, col. 1. 

This second role of methylation is directly relevant to the present invention which utilizes high 
levels of integrated retroviral vectors. Bestor and Tycko explain this relevance: 

The host defense hypothesis requires that the silencing apparatus recognize and inactivate 
parasitic sequence elements. Nearly all transposition and viral integration intermediates 
share certain structural features, and some satellite DNA is thought to undergo 
amplification by extrachromosomal rolling circle replication followed by insertion of the 
array into the genome. Recognition and de novo methylation of CpG sites in and around 
features characteristic of integration reactions would insure the inactivation of the 
invasive element immediately upon its integration. DNA methyltransferase may have an 
intrinsic ability to recognize integration intermediates that are characteristic of the above 
integration events. The de novo sequence specificity of mammalian DNA 
methyltransferase is strongly dependent on alternative secondary structures in DNA; four- 
way junctions in cruciform structures formed by inverted-repeats in supercoiled-plasmids 
are especially favored targets, as are secondary structures in artificial oligonucleotide 
substrates. This biochemical property suggests that invasive sequences might be targeted 
for de novo methylation because of their presentation of alternative secondary structures 
during integration (Fig la). 

(p. 364, col. 2, Citations omitted). 

Thus, it was a concern that due to the nature of retroviral integration, the retroviral vectors would 
be targeted for inactivation by methylation. Increasing copy number enhances this problem. "A 
common characteristic of invasive sequences is their presence in multiple copies, and it has 
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recently become known that repeated sequences can interact so as to trigger their mutual 
silencing." p. 364, col. 2. Bestor and Tycko further address retroviral vectors: 

Retroviral vectors that transducer reporter genes or therapeutic agents have been observed 
to undergo methylation silencing after variable periods of expression in animals. 
Susceptability to de novo methylation and silencing has limited the usefulness of 
retroviral vectors in the construction of transgenic mouse lines, and it is probable that 
silencing phenomena may emerge as barriers to long term somatic gene therapy in 
humans. Successful gene transfer may require development of delivery vectors that evade 
the silencing response. 

p. 365, col. 2. 

Bestor and Tycko 1996 demonstrates why Gunzburg et al. is not relevant to the invention. 
Gunzburg et al. does not address the host defense mechanism at all or that fact that vectors had 
been shown to be actively silenced by methylation. 

Garrick et al. 1998 (Tab 2) provide evidence on repeat-induced gene silencing in mammals. 
They used a lox/cre system to analyze the effect of copy number on transgene expression. They 
found that "reduction in copy number results in a marked increase in expression of the transgene 
and is accompanied by decreased chromatin compaction and decreased methylation at the 
transgene locus." p. 56, col. L, p. 58. Again, this paper provides evidence that the state of the art 
was that increasing copy number leads to methylation and inactivation of transgenes. The vector 
construct was not a retroviral vector, but this data is directly relevant to inactivation of an 
introduced transgene. As Bestor and Tycko 1996 indicated, the host defense mechanism is 
triggered by multiple copies of invasive sequences. The transposon system used in this paper and 
retroviral vectors are both invasive sequences. 

Cherry et al. 2000 (Tab 3) is co-authored by two of the leading scientists in the field, Dr. David 
Baltimore and Dr. Rudy Jaenisch. They also recognize the role of methylation in the inactivation 
of proviral genes. They state: "DNA methylation is thought to be a general mechanism 
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used by cells to silence foreign DNA and may be involved in the cell defense against 
transposable elements (39). DNA methylation has also been associated with the repression of 
gene expression and the silencing of viral control elements (2, 14, 38). Exogenously introduced 
retroviruses silenced in vitro and in vivo can be reactivated by treatments that result in 
genome wide demethylation. In addition, transcriptionally silent endogenous retroviral elements 
are reactivated upon loss of genomic methylation in Dnmtl knockout mice (38). Therefore, 
DNA methylation is thought to causally repress expression of retroviral promoters in a variety of 
cell types." p. 7419, col. 1-2. Both methylation-dependent and methylation-independent 
mechanisms exist to control retroviral gene expression." p. 7425, col. L 

Other papers also provide evidence regarding methylation and inactivation of multiple introduced 
copies of exogenous genes and inaction of retroviral vectors. Mehtali et al. 1990 (Tab 4) 
conducted experiments that show that methylation of an introduced transgene increases with 
increasing copy number and that expression of the transgene decreases with increasing copy 
number after initially increasing. See Table 1 , p. 1 82. The vector construct was not a retroviral 
vector, but this data is directly relevant to inactivation of an introduced transgene. 

Niwa et al. 1983 (Tab 5) postulated that that there are two independent mechanisms that block 
expression from newly acquired retroviral vectors. See Abstract, p. 1 105. The first mechanism 
operates in undifferentiated cells to block expression of M-MuLV and other exogenously 
acquired viral genes, such as SV40 and polyoma virus, and does not depend on DNA 
methylation. The second mechanism relates only to differentiated cells and represses expression 
of genes in which DNA is methylated. This paper further serves to demonstrate why the 
Examiner's reliance on Gunzburg et al. is inappropriate. Newly acquired retroviral vectors are 
treated by cells in a different manner from proviral sequences that have been integrated into the 
genome in the distant past and essentially become endogenous. 

Svoboda et al. 2000 (Tab 6) examines the expression of retroviral vectors in foreign species. The 
vectors are subject to cell-mediated control at the transcriptional and posttranscriptional levels. 
Abstract, p. 1 8 1 . Of main importance is cell transcriptional regulation, which can lead to 
proviral silencing, p. 181, col. 2. The authors note that all of the data so far point to the 
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important role of methylation in provirus silencing in general and that strategies for preventing 
methylation should contribute to more efficient gene transfer in the future, p. 186, col. 2. Again, 
the state of the art was that newly acquired retroviral vectors are subject to silencing by 
methylation. This is in direct contrast to the Examiner's conclusions based on Gunzburg et al. 

Ellis and Pannell 2001 (Tab 7) also examine retrovirus silencing. They state that inclusion of 
appropriate regulatory elements may not be sufficient because the vectors are frequently silenced 
and that a better understanding of the mechanism of vector silencing is needed, p. 17, col. 1-2. 

Challita and Kohn 1994 (Tab 8) provide data that shows that lack of expression following 
retroviral transduction is due to methylation. As stated by the authors: "Methylation of cytosine 
residues has been shown to be associated with suppression of gene expression and, in certain 
circumstances, with the silencing of viral control elements (6). The MoMuLV-LTR is completely 
inactive in embryonic stem and embryonic carcinoma cell lines, and the inactivity is 
accompanied by de novo methylation of the proviral sequences (7, 8). Moreover, methylation has 
been detected in association with the MoMuLV-LTR transcriptional inactivity in fibroblasts in 
vitro (9) and in vivo (2)." As shown by Ellis and Pannell (Tab 6), these problems still had not 
been solved by 2001, even when regulatory elements other than the retroviral LTR are used. 

9. The references cited above establish that at the time of our invention the state of the art 
was that: 1) cells have a host defense mechanism that inactivates newly introduced, invading 
sequences such as retroviral vectors; 2) the host defense mechanism operates by methylation of 
the invading sequences, which causes transcriptional inactivation of the sequences; 3) 
transcriptional inactivation by methylation leads to reduced expression from retroviral vectors; 4) 
the inactivation may be triggered by structures formed during integration of the retroviral vectors; 
and 5) the presence of multiple repeats of an invading sequence such as a retroviral vector 
triggers methylation and inactivation. 

10. The citations for the references cited above follow. These references are provided in the 
Appendix attached to this Declaration. 
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11. I further declare that all statement made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under section 1001 of title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
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There are two biological properties of 
genomic methylation patterns that can be 
regarded as established. First, methylation of 
5'-CpG-3" dinucleotides within promoters 
represses transcription, often to undetectable 
levels. Second, in most cases methylation 
patterns are subject to clonal inheritance. 
These properties suit methylation patterns for 
a number of biological roles, although none of 
the current hypotheses can be regarded as 
proved or disproved. One hypothesis 
suggests that the activity of parasitic 
sequence elements is repressed by selective 
methylation. Features of invasive sequences 
that might allow their identification and 
inactivation are discussed in terms of the 
genome defense hypothesis. Identification of 
the cues that direct de novo methylation may 
reveal the biological role (or roles) of genomic 
methylation patterns. 



; Hie mammalian genome is ornamented with ~3 x 10 7 
methyi groups, all at the 5 position of cytosine (m 5 C) and 
mo .sLat_ S^CpG-3 1 dinudeotides. Methylation patterns 
increase the information content of the genome 1 and are. 
transmitted by clonal inheritance 2 ; methylation of CpG 
sites within promoters represses transcription 3 . This nat- 
ural modification is also dangerous: rrrC is the major 
endogenous mutagen (examination results in C— >T tran- 
sition mutations at CpG sites, which account for about 
on e-third of al l mutations in hnrrians) 4 , and tumour sup- 
pressor genes are frequently inactivated by ectopic de novo 
methylation of promoter regions 5 * 6 . However, there must 
be benefits that yield a net selective advantage. This is 
shown by the retention of cytosine methylation by virtu- 
ally all organisms with genomes >5 x 10* basepairs 7 , and 
by the demonstration that perturbations of methylation 
patterns are lethal to mouse embryos and to differentiated 
cells 8,9 . While methylation patterns dearly provide some 
essential function, the nature of that function or functions 
is still enigmatic 

There are at present two salient hypothetical roles of 
genomic methylation patterns. The first hypothesis (put 



forward twenty years ago) suggests that programmed 
demethylation and de novo methylation play a direct role 
in gene control during development 10,11 . Methylation 
patterns would be established during gametogenesis or 
early development, and regulatory factors would mediate 
the removal of methyl groups from promoters to allow 
the expression of tissue-specific genes at the appropriate 
stage of differentiation. While supported by a large body 
of indirect and correlative evidence, a defimtive test of the 
causality of cytosine methyiatioh in developmental gene 
control has been elusive. By the same token, a 
developmental role for cytosine methylation has not yet 
been disproved. It seems likely that methylation patterns 
might reinforce the heritability of states of gene ex- 
pression mediated by chromatin proteins analogous to 
the Polycomb and trithorax group of proteins from 
Drosophila 12 ' 13 . 

The second hypothesis suggests that cytosine methyla- 
tion is part of a genome defense system which inactivates 
parasitic sequences such as transposable elements and 
proviral DNA 4 * 7 . Most of these elements are in fact 
methylated and transcriptionally inert in the genome of 
mammals, flowering plants, and those fungi whose 
genomes contain m 5 C Treatment of cultured cells or 
mice with the demethylating drug 5-azacytidine can acti- 
vate silent retroviruses and endogenous genes that have 
been silenced by ectopic de novo methylation of regulato- 
ry regions 14 . It is striking that Drosophila, whose DNA 
lacks m 5 Q suffers far larger numbers of insertion mu- 
tations than do animals whose genomes are methylated 15 . 
These observations, with support from evolutionary 
considerations 7 , strongly suggest that cytosine methyla- 
tion is part of a genomic host defense system that sup- 
presses the transcription of parasitic sequence elements. 
The selective advantage of such a defensive system is 
obvious, given that a sizable fraction of the genome rep- 
resents parasitic sequences that are invisible to the 
immune system and which might inflict intolerable 
mutational or cytotoxic damage if allowed, to proliferate 
unchecked. 

Inese hypothetical roles of genomic methylation in 
development or host defense place fundamentally differ- 
ent requirements on the DNA methylating system. The 
developmental role requires that methylation patterns be 
established as part of the developmental program via 
conventional sequence recognition mediated by sequence- 
specific DNA methyltransferases of specificity factors that 
interact with the ubiquitous DNA methyltransf erase. The 
host-defense function requires a completely different 
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Rg. 1 Structure-dependent de novo methylation. Complementary 
sequences are depicted in identical colors, and structural features 
that are proposed to trigger silencing are boxed. M: 5-methylcyto- 
sine.'a, A retrovirus integration intermediate showing the stage at 
which free 3' ends of the proviral DNA have been linked to 5' ends of 
chromosomal DNA. Most transposition and retroviral integration 
events utilize a common mechanism 19 , and integrases and trans- 
posases share sequence or structural similarities. Recognition of 
integration/transposition .intermediates by the silencing system 
could protect the genome from the proliferation of parasitic 
sequences, b, formation of three-way junctions in replication slip- 
page structures at (GGC)/(GCC) repeats. Nearly all triplet repeat 
expansions at (GGC)/(GCC) sequences trigger cfe now methylation 
in and around the repeats, and it is suggested that the three-way 
junction is the stimulus. Silencing of the FMR1 gene in Fragile X syn- 
drome is suggested to occur via this pathway, c, Detection of inva- 
sive sequences via homology/heterology boundaries during strand 
exchange 32 . Recognition and methylation of the boxed regions 
would result in reciprocal silencing of repeated sequences that 
reside in different sequence contexts. 



source of specificity; de novo methylation must be cued by 
dynamic structural features unique to parasitic sequences 
in the act of invading the genome, or by events such as 
transient pairing interactions of repetitive elements. In 
* other words, the developmental role requires that methyla- 
tion be directed to specific sequences as part of the devel- 
opmental programme, while the host defense role requires 
that methylation be contingent on the interaction of fea- 
tures characteristic of parasitic sequence elements with the 
DNA memylatmg system. Therefore it should be possible 
to deduce the role of methylation patterns from the nature 
of the biochemical events by which they were established 

Sequences that attract or repel de novo methylation in 
transacted embryonic cells have been identified 16 "" 18 , but 
many sequences (both foreign and endogenous) can be 
stably propagated in either the methylated or unmethylat- 
ed state, and de novo methylation of such elements must 
therefore be dependent on features other than simple 
sequence recognition. The remainder of this article dis- 
cusses possible mechanisms by which such sequences 
might be recognized and silenced 

3R4 



Invasive DNA/n flagrante 

The host defense hypothesis requires that the silencing ap- 
paratus recognize and inactivate parasitic sequence ele- 
ments. Nearly all transposition and viral integration 
intermediates share certain structural features 19 , and 
some satellite DNA is thought to undergo ampHflcation 
by extrachromosomal rolling circle replication followed 
by insertion of the array into the genome 20 . Recognition \ 
and de novo methylation of CpG sites in and around fea- 
tures characteristic of integration reactions would insure 
-theinactivatioriof the invasive element irnmediatelyjupon 
its integration. DNA methyltransferase may have an 
mtrinsic ability to recognize integration intermediates 
that are characteristic of the above integration events. The 
de novo sequence specificity of mammalian DNA methyl- 
transferase is strongly dependent on alternative secondary 
structures in DNA; four- way junctions in cruciform struc- 
tures formed by inverted repeats- in- sup ercoiled-plasmids 
are especially favored targets 21 , as are secondary structures 
in artificial oligonucleotide substrates 22 . This biocfiemical 
property suggests that invasive sequences mighi be target- 
ed for de novo methyjatipn because of their presentation 
of alternative secondary structures during integration (Fig 
la). 

Three-way junctions share structural features with 
four-way junctions, and enzymes that recognize four-way 
junctions (such as T4 endonudease VII (re£ 23) and 17 
endonuclease I (ref.. 24)) recognize three-way junctions as 
welL The nature of the interaction of the methylating sys- 
tem with three-way junctions is of Special interest because 
it may be involved in the aetiology of human diseases 
associated with GGC/GCC triplet repeat expansions. Slip- 
page of replication intermediates can result in extrusion of 
a segment of GGC/GCC repeats, which form stable three: 
way junctions despite a lack of perfect complementarity in 
the extruded segment 22,25 (Fig. lfy. The preference of 
DNA methyltransferase for junctions may result in de 
novo methylation of sequences in slippage structures. Dis- 
eases that result from GGC/GCC triplet repeat expansions 
may be thought of as autoimmune disorders, in which a 
host defense system attacks an innocuous alteration of an 
endogenous gene because of its incidental similarity to a 
parasitic sequence element in the act of integration. 

Cytostne methylation and pairing interactions 

A common characteristic of invasive sequences is their 
presence in multiple copies, and it has recently become 
known that repeated sequences can interact so as to trig- 
ger their mutual silencing. Fungi and flowering plants 
have diverse and highly effective means of silencing 
repeated sequences, and cytosine methylation is associated 
with silencing in nearly all" cases. REP (repeat-induced 
point mutation) imposes methylation, silencing, and large 
numbers of C— *T transition mutations on repeated 
sequences during the sexual phase of the fungus Neu- 
rospom crassa 16 , while MIP (methylation induced 
premeioticaUy) inactivates arid methyhtes repeats during 
the sexual cycle of the fungus Ascoholus imniersus 27 . Flow- 
ering plants can also silence and methylate repeated 
sequences via RIGS (repeat-induced gene sUencing) 28, 29 ; 
the efficiency is such that multicopy transgenes frequently 
cause mutual silencing of themselves and of homologous 
resident sequences. The process can be remarkably effi- 
cient; unlinked transgene sequences as small as 300 bp can 
identify and inactivate each other in a genome of >10 9 bp. 
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The silenced state can persist even after the repeated se- 
quences have been separated by segregation in sexual 
crosses 28 . Transgene silencing of this type is emerging as a 
barrier to the improvement of commercially-important 
plant species 30,31 . It is difficult to imagine a selective 
advantage for RIP, MIP, or RIGS other than host defense " 
against the proliferation of parasitic sequence elements. 

Transient pairing interactions are. likely to be involved 
in the imposition, of silencing and methylation on repeats; 
two mechanisms can .be envisioned. First, strand ex- 
change between repeats at non-allelic positions will pre- 
sent an abrupt loss of homology at the junction with 
flanking sequences (Fig. lc), and silencing may be pro- 
voked by factors that recognize characteristic structural 
features at the homology/heterology boundary. This idea 
wks developed by Signer and colleagues 32 to explain copy 
nuniber-dependent transgene silencing in Arabidopsis. 
Second, methylation might propagate through a network 
of repeated sequences once one or more copies are methy- 
lated (Fig. 2). Strand exchange between methylated and 
unmethylated repeated sequences creates hemimethylated 
intejfmediates, which as mentioned previously, are strong- 
ly preferred substrates of DNA methyltransferase 1 2 ; the 
preference for henoimethylated substrates normally con- 
tributes to maintenance methylation, but under these 
conditions de novo methylation is the result Strand ex- 
change reactions (which are thought to occur frequently 
as part of the double strand break repair pathway 33 ) (Fig. 
2) could cause a common methylation pattern to propa- 
gate through a network of repeats 28,30 , and the probability 
of pairing with a methylated repeat will increase in pro- 
portion to the number of methylated repeats. A large 
number of methylated resident repeats will therefore 
increase_the, probability that a new, unmethylated copy 
will be methylated soon after its insertion into the 
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Fig. 3 Vulnerability of heterodupiex recombination intermediates to 
de novo methylation. Crossing-over during meiosis presents DNA 
methyltransferase with highly preferred targets: four-way junctions, 
mismatches and hemimethyiated sites. Deregulated de novo 
methylation appears to be prevented by sharp down-regulation of 
DNA methyltransferase during the pachytene stage of meiosis; the 
mechanism involves the production of a larger, non-translated DNA 
methyltransferase mRNA 4 **. 



Fig. 2 Unidirectional transfer of epigenetic in- 
formation via paired hemimethylated intermedi- 
ates. Strand exchange between methylated and 
unmethylated repeated sequences presents 
hemimethylated sites, which are the preferred 
substrate of DNA methyltransferase; note that the 
maintenance activity "of DNA methyltransferase 
leads to de novo methylation under these condi- 
tions. A common methylation pattern could prop- 
agate through an array or network of repeats by 
this mechanism, and methylation patterns could 
be transferred between alleles at loci that bear 
allele-specrfic methylation patterns. 



genome. A large burden of parasitic sequences may have 
the paradoxical effect of repressing their own activity and 
that of homologous invasive sequences. Natural selection 
may actually favor the retention of large numbers of inac- 
tive parasitic sequences for this reason. 

Many endogenous genes are present in multiple copies 
which escape repeat-dependent gene silencing. In Neu- 
rospora, which seems to be especially aggressive in its 
response towards repeated sequences, the efficiency of 
RIP is greater when the repeats are in close proximity 26 . 
This argues for special protective mechanisms that shield 
repeated cellular genes from repeat-induced silencing. A 
complete intolerance of repeated sequences would also 
put severe constraints on the evolution of new functions, 
which depends on the duplication and divergence of 
existing genes. However, the factors that control sensitivi- 
ty to repeat-dependent silencing are poorly understood 34 . 

There is no dear experimental evidence that repeat- 
dependent silencing or methylation operates in mam- 
mals, as is the case in fungi and plants. However, no 
objective search for repeat-dependent silencing in mam- 
mals seems to have been conducted. Circumstantial evi- 
dence suggests that it may occur 35 . Much of the m 5 C in 
the mammalian genome is found in repetitive DNA, and 
most of the repeated and potentially transposable el- 
ements in the genome are methylated and tran- 
scriptionally inert through most of development 36 ' 37 . 
Retroviral vectors that transduce reporter genes or thera- 
peutic agents have been observed to undergo methylation 
silencing after variable periods of expression in animals 38 . 
Susceptibility to de novo methylation and silencing has 
limited the usefulness of retroviral vectors in the con- 
struction of transgenic mouse lines, and it is probable that 
silencing phenomena may emerge as barriers to' long- 
term somatic gene therapy in humans. Successful gene 
transfer may require the development of delivery vectors 
that evade the silencing response. It is also possible that 
the development of therapeutic agents that perform selec- 
tive methylation of deleterious sequence elements (such 
as HIV-1 proviral DNA) will activate an existing host 
defense- system and therefore - serve as- a sortrof nuclear 
vaccine. 

Maintenance of allele-specrfic methylation patterns 

Mammalian genes whose expression is dependent on the 
sex of the contributing parent are said to be imprinted; 
such genes bear allele-specinc methylation patterns that 
are necessary for maintenance of the imprinted state 9 . 
Short repetitive sequences are characteristic of imprinted 
genes 39,40 ; these short repeats may form alternative sec- 
ondary structures that trigger de novo methylation in the 
germ line or during early postfertilization development-. A 
parent-of-origin effect on gene expression results if there 
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are differences in the types of DNA or chromatin struc- eversion of the target cytosine during the transmethyla- 

tures formed, or if the response to the structures differs, in tion reaction 46 ' 47 . Presentation of these vulnerable sites 

the male and female germlines. In this way a nuclear host might create a "methylation ratchet" in which methyla- 

defense system may give rise to allele-specific methylation tion levels increase in an unregulated fashion with each 

. patterns and to the genetic phenomenon of imprinting 41 , meiotic cycle. 

and could also account for documented gamete-of-origin Examination of purified germ cells from male mice 

effects on methylation of repetitive DNA 37 . It should be showed that both DNA methyltransferase protein and the 

noted that several mutations that result from insertion of 5.2-kb DNA methyltransferase mRNA found in all prolif- \ 

retroviruses of the intracisternal A particle (IAP) type erating cell types was present in all germ cell fractions, 

confer imprinted behavior on genes that are not imprint- except for pachytene spermatocytes. These spermatocytes 

edm the wud type 42 ' 43 , ■.showed anabsence QtDHAjneJ^yltiartfferase prQtejriand 

Alleles at imprinted loci are asymmetrically methylated, contained a 6.2-kb RNA that was not associated with 
and the mechanism shown in Fig. 2 will tend to convert polyribosomes. It is the pachytene stage of meibsis I where 
asymmetrical allelic methylation patterns towards the most crossing-over occurs. The ubiquitous 5.2-kb DNA 
more heavily methylated pattern. Certain pathological methyltransferase mRNA and DNA methyltransferase 
human conditions show abnormalities in the functional protein reappeared at the conclusion of the crossing-over 
imprinting of particular chromosomal regions which phase of meiosis 48 . These findings suggest that meiotic 
might arise via this type of interchro mo somal transfer of recombination intermediates -are _ protected from Ae^navo 
epigenetic information. For example, Wilms' tumours fire- methylation through down-regulation of DNA methyl^ 
quently show conversion to a uniparental (bipaternal) transferase via a novel post-transcriptional mechanism 
methylation and expression pattern at imprinted loci in that involves the production of a larger, non-translated 
the H19/IGF2 region on chromosome llpl5.5, which RNA transcript. 
could result from the local transfer of methylation pat- 
terns from the paternal chromosome to the less heavily- The function of cytosine methylation 
methylated maternal chromosome 44 . It is important to The tact that cytosine methylation can increase the infor- 
note, however, that a conversion to a symmetrical uni- mation content of DNA has tempted many to attribute 
parental allelic methylation pattern at imprinted loci diverse roles to methylation patterns. Cytosine methyia- 
could also come about by an alternative mechanism that tion has been proposed to reduce the effective size of the 
involves deletion or mutations of a as-acting DNA ele- genome by masking non-regulatory regions in large- 
ment (an 'miprinting centre'). This has been implicated in genome eukaryotes 7,49 , and central roles in DNA 
certain kindreds that show disruption of imprinting pat- repair 22 ^ 50 and replication 51 have also been mooted. None 
terns within the chromosome 15 Prader Willi/Angelman of the hypothetical functions of cytosine methylation (and 
syndrome region 45 . this includes the developmental and host-defense func- 

. tions) has the support of compelling experimental evi- 

Vulnerabilrty of meiotic recombination intermediates dence, and all, some, or none of the hypotheses may be 

Allelic heteroduplex recombination intermediates present valid. At this time it seems that the true function of cy- 

several features that should make them very vulnerable to tosine methylation will be understood only when we learn 

denovo methylation (Fig. 3). First, allelic methylation dif- how the cell selects specific sequences for covalent modifi- 

ferences create heniimethylated sites in the heteroduplex- cation, 
es, which provoke de novo methylation via the 

maintenance activity of DNA methyltransferase (see Fig. Aknowiedgements 

2). Second, Holliday structures and four-way junctions E SeJkerandM. Turkerrnadehelpfidcomrnents on ^manuscript 

are necessarily present; as described earlier, these struc- Supported by NIH grants GM00616 and CA60610 (to T. KB.) and 

tures appear to be favored targets of DNA methyl- NIH grant CA60765 and American Cancer Society grant JFRA482 (to 

transferase. Third, mismatches in the vicinity of CpG B.T). 

dinudeotides greatly favor de novo methylation, presum- ■ 

ably by lowering the energetic barrier associated with Received 27 November 1995; accepted 3 January 1996. 
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Repeat-induced gene silencing In mammals 

David Garrick 1 , Steven Fiering 2 ' 3 , David IX Martin 2 & Emma Whitelaw 1 



In both plants 1 " 3 and Drosophila melanogaster* 5 , expression 
from a transgenic locus may be silenced when repeated trans- 
gene copies are arranged as a concatameric array. This repeat- 
induced gene silencing is frequently manifested as a decrease 
in the proportion of cells. that express the transgene, resulting 
in a variegated pattern of expression. There is also some indi- 
cation that, in transgenic mammals, the number of transgene 
copies within an array can exert a repressive influence on 
expression, with several mouse studies reporting a decrease in 
the level of expression per copy as copy number increases 6-8 . 
However, because these studies compare different sites of 
transgene integration as well as arrays with different numbers 
of copies, the expression levels observed may be subject to 
varying position effects as well as the influence of the multi- 
copy array. Here we describe use of the lox/Cre system of site- 
specific recombination to generate transgenic mouse lines in 
which different numbers of a transgene are present at the 
same chromosomal location, thereby eliminating the contribu- 
tion of position effects and allowing analysis of the effect of 
copy number alone on transgene silencing. Reduction in copy 
number results in a marked increase in expression of the trans- 
gene and is accompanied by decreased chromatin compaction 
and decreased methylation at the transgene locus. These find- 
ings establish that the presence of multiple homologous copies 
of a transgene within a concatameric array can have a repres- 
sive effect upon gene expression in mammalian systems. 

The aPElox construct used to generate transgenic mouse 
lines is shown in Fig. 1 a. In this transgene, expression of the lacZ 
reporter gene is driven by the human a-globin promoter and 
the aHS-40 enhancer-like element of the a-like globin locus 9 . 
As expression of the lacZ reporter can be analysed in single ery- 
throid cells, use of this transgene allows the detection of varie- 
gated patterns of expression 10 . The transgene also contains a 
single copy of the 34-bp recognition site (loxP) of the Cre 
recombinase of bacteriophage PI (ref. 11). Cre-mediated 
recombination between loxP sites in individual aPElox trans- 
genes that form part of a concat americ array will red uce the 
transgene copy number without altering the site of integration 12 
(Fig. 1 by. Although transgenes within a multicopy array in mice 
are usually present in a tandem (head-tail) orientation, inverted 
repeats (head-head or tail-tail) do occur 13 . Because Cre recom- 
bination between inverted loxP sites causes the inversion rather 
than excision of the intervening DNA, only recombination 
between sites in like-oriented transgene monomers will reduce 
the copy number of the array. 

We generated founder transgenic mice bearing the aPElox con- 
struct, and from them we established hernizygous transgenic lines. 
Southern blotting of tail DNA. revealed that two of these trans- 
genic lines (aPEloxl and ctPElox7) contain more than 100 copies 
of the transgene (Fig. 2), and these lines were selected for Cre- 
mediated reduction in transgene copy number. For each of these 
parent lines, we collected fertilized oocytes from wild-type female 
mice mated with G l hernizygous transgenic males. Oocytes were 



micro-injected with a circular Cre expression vector (pCAGGS- 
Cre) and then transferred into foster mothers. Transient expres- 
sion of the recombinase from the unincorporated pCAGGS-Cre 
plasmid has been shown to catalyze efficient site- specific recombi- 
nation at loxP sites within the mouse genome before the morular 
stage of development 14 . Live-born progeny that showed a reduced 
transgene copy number as determined by initial Southern blotting 
were mated with wild-type animals to establish transgenic lines 
hernizygous for the modified locus. Cre recombination within the 
aPEloxl parent line (more than 100 copies) gave rise to two dis- 
tinct reduced-copy progeny lines: l.cre/a, which contains five 
copies of the transgene, and l.cre/b, in which the array has been 
reduced to a single copy (Fig. 2). Southern-blot analysis indicated 
that the single copy remaining in the l.cre/b line contains a 
rearrangement/deletion (data not shown). The aPElox7 parent 
line (more than 100 copies) gave rise to a single derivative line 
(7.cre7a) bearing one copy of the transgene. 

For parental and reduced-copy progeny lines, transgene 
expression was analysed in 12.5-dpc embryos by staining of 
whole primitive erythrocytes with X-gal. We previously showed 
that all cells containing pi-galactosidase activity can be detected 
by light microscopy after staining under these conditions 10,15 . 
For both of the high-copy parent lines, a heavily variegated pat- 
tern of transgene expression was observed, with less than 1% of 



O-glohin 





Fig. 1 Strategy for Cre-mediated reduction of transgene copy number, a, The 
aPElox transgene construct contains nucleotides -573 to +36 of the human 
o-globin promoter upstream of the SDK-ZdeZpA cassette used previously 30 and 
a 4.2-kb fragment containing the aHS-40 DNasel hypersensitive site (vertical 
arrow). An oligonucleotide containing a single copy of the 34-bp /oxPsrte was 
inserted between the SDK-tecZpA cassette and the aHS-40 fragment. Black 
lines indicate probes used in this study. H, H/ndlll; P, Pstl; to, £co0109 1; S. Sad; 
£, fcoRi; B, BamHI; K, Kpnl h. When multiple copies of the aPElox transgene 
are situated within a concatameric array in the mouse genome, Cre-mediated 
recombination between like-oriented loxP sites within the array removes the 
intervening DNA as a circular episome, leaving a reduced-copy array at the 
same chromosomal location. The forward (excision) reaction is heavily 
favoured over the reverse (integration) reaction, which requires recombina- 
tion between loxP sites on separate molecules. 
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Fig. 2 Copy-number determination for ocPElox parental and Cre-modified 
progeny transgenic lines. DNA was obtained from tail biopsies of three-week- 
old Gt hemizygous mice for each of the lines shown, as well as from lines bear- 
ing known copy numbers of a transgene that contains the same /acZ-ccHS-40 
cassette. After digestion with Sad, DNA was resolved on a 0.8% agarose get 
and transferred to a nylon membrane. The membrane was hybridized with 
fragment B (Fig. 1a), and the copy number was determined by quantifying the 
intensity of the resulting signal and comparing it with those of the standards. 
To ensure equivalent DNA loadings in each lane; the membrane was stripped 
and re-hybridized with a probe for the murine ct-globin gene (ma). In a sepa- 
rate Southern biot performed on tail DNA from each of these lines, hybridiza- 
tion with a probe specific for the 3' end of the transgene (fragment C Fig. 1a) 
revealed that the junction fragments between insert and mouse DNA gener- 
ated with the restriction enzymes E coRI or fcoRV for each of the reduced-copy 
progeny lines were the same as those observed in the respective parent arrays, 
indicating that the genomic location of the array had not been altered during 
Cre modification (data not shown). 




copy number 1 5-8 15 40 >100 5 1 >100 1 



ma 




erythroid cells containing an active transgene locus (Fig. 3, 
Table 1). Similar findings of variegated expression patterns in 
high-copy transgenic lines bearing a construct (odPE) that differs 
from aPElox only in the absence of the single loxP element 16 sug- 
gest that the presence of the 34-bp loxP oligonucleotide has no 
effect on transgene expression. When the number of transgene 
copies within the aPEloxl array was reduced to five by Cre- 
mediated recombination (l.cre/a), there was a large increase 
(more than 1,000-fold) in the percentage of primitive erythroid 
cells expressing the transgene. Similarly, Cre-mediated reduction 
of the 0tPJElox7 parent array to one copy (7.cre/a) also suppressed 
the variegated expression that was observed in the parent line, 
with a 180-fold increase in the size of the expressing population 
(Fig. 3, Table 1). In these two independent transgenic lines, the 
presence of high-copy multimeric arrays is therefore associated 
with a silencing of transgene expression, which is observed as a 
decrease in the percentage of cells containing an active transgene 
locus. A decrease in the number of copies within each array cor- 
relates with a suppression of variegation — that is, an increase in 
the size of the expressing population. No expression of the trans- 




gene was observed in the line l.cre/b, in which the single trans- 
gene copy was rearranged (data not shown). 

In plants, repeat-induced gene silencing has been observed at 
both the transcriptional 17 " 19 and post-transcriptional 20 ' 21 stages 
of gene expression. Tb determine whether the copy-num- 
ber-dependent silencing of transgene expression in mouse ery- 
throid cells occurs at transcription or involves a post- 
transcriptional modification, we performed run-on analysis in 
nuclei of 12.5-dpc erythrocytes from aPEloxl and its reduced- 
copy derivative, l.cre/a (Fig. 4a). Run-on transcripts from the lacZ 
reporter gene were present in l.cre/a primitive erythrocytes but 
were not detectable in erythroid cells of the high-copy parent line, 
indicating that the silencing of expression from multi-copy trans- 
gene arrays occurs at the level of transcription. The methylation 
status and local chromatin structure of the transgene locus in 
these two lines were also compared Although trans genes present 
in 12.5-dpc erythrocytes of the reduced-copy line l.cre/a appear 
urimethyiated, the high-copy aPEloxl transgene locus at the same 
genomic location was found to be heavily methylated (Fig. 4 b) . To 
analyse chromatin structure, we performed endonuclease protec- 
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Fig. 3 X-gal staining of primitive ery- 
throcytes from the aPElox transgenic 
lines and their reduced-copy progeny. 
Individual 12.5-dpc embryos from the 
transgenic lines shown were bled into 
PBS and whole blood cells were 
stained with X-gal before visualiza- 
tion under light microscopy. For 
aPEloxl, many fields of view had to 
be scanned to detect a single blue cell. 
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Table 1 • Copy number and erythroid expression 



Transgenic line 


Copy number 


Percentage of 12.5-dpc 
erythroid cells 
expressing lacZ 


aPEIoxl 


>100 


0-0.01 
<n=11) 


l.cre/a 


5 


65±9 
(n = 11) 


aPEIox7 


>100 


0.3±0.02 
(n=4) 


l.ae/a 


1 . 


54 ±9 
(n=3) 



quefitly contain high- copy concatameric arrays of a monomelic 
unit, this observation has important implications for the mean- 
ingful interpretation of transgenic experiments in mammals. The 
influence of copy number on silencing is likely to be most promi- 
nent in lines such as those studied here, in which the array is very 
large, but repeat-induced silencing may often be responsible for 
poor transgene expression. In addition to copy number, as shown 
here, the factors influencing transgene silencing in mice include 
integration site, the lineage in which expression occurs and the 
as-acting control elements within the transgene 23 . Transgene 
constructs for which copy-number-dependent expression has 
been reported 24 ' 25 may contain genetic elements that function to 
insulate individual monomers and prevent silencing. Without 
such elements, the lox/Cre system of site-specific recombination 
could be used to activate expression in high-copy lines. 

The hypermethylation of transgenes within the inactive, 
high-copy array noted here has also been reported in other cases 
of repeat-induced silencing 1 ' 3,26,27 , although it is unclear in our 
system whether methylation precedes or is a consequence of the 
observed chromatin restructuring. The occurrence of copy- 
number-dependent transgene inactivation in Drosophiloj in 
which DNA methylation has not been detected 28 , suggests that 
methylation is not necessary for the chromatin remodelling 
associated with repeat-induced silencing and that hypermethy- 
lation of the aPElox transgene in mice may be a secondary 
modification that occurs at already inactive, high-copy arrays. 
This hypothesis is consistent 'with our previous fmding that the 
variegated silencing of globin/ ZacZ transgenes in mice correlates 
with an inactive chromatin structure but not hypermethylation 
at the transgene locus 29 . 

Although a molecular mechanism for repeat-induced silencing 
of mdtimeric arrays is yet to' be fully elucidated, the correlation 
between the silenced state and the adoption of a less accessible 
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Transgene copy numbers were determined by Southern analysis (Fig. 2). The 
percentage of 12.5-dpc erythroid cells expressing lacZ was determined by 
staining cells as described (Fig.3) and scoring a minimum of 200 cells for a 
detectable blue colour. Expression data are presented as the mean ± one stan- 
dard deviation — except for aPEIoxl, where a range is given, n, number of 
individual transgenic embryos assayed for each line. 

tion assays 22 on nuclei isolated from 12.5-dpc primitive erythroid 
cells from these two lines. In their native chromatin configuration, 
transgenes in the high-copy array (aPEIoxl) were more resistant 
to endonuclease digestion at a site within the transgene promoter 
than transgenes that were at the same genomic location but within 
a lower-copy array (l.cre/a; Fig. 4c). The transcriptional silencing 
that occurs at his^h-copy arrays is therefore associated both with 
hypermethylation of transgenes in the locus and with the adop- 
tion of a repressive local chromatin configuration. 

These results establish that the presence of multiple repeats 
within a high-copy array can directly repress transgene expres- 
sion in a mammalian system. Because transgenic animals fre- 

Fig. 4 Transcription, methylation and 
chromatin accessibility of the trans- 
gene. a; Nuclear run-on analysis of 
latZ transcription in aPEIoxl and 
l.cre/a. Nuclei were prepared from 
primitive erythroid cells collected 
from 12.5-dpc transgenic embryos of 
the lines aPEIoxl and 1.cre/a. Run-on 
transcripts were synthesized and hy- 
bridized to membranes containing the 
following DNA fragments: the mouse 
a-globin gene from +17 to +956 (ma), 
the mouse £-globin gene from +150 to 
+969 (mQ and a fragment of the TacZ" 
gene, which includes nucleotides from 
+39 to +1164 relative to the start 
site of the aPElox transcript (/acZ). 
^ Analysis of methylation status of 
the transgene locus in aPEIoxl and 
l.cre/a. DNA from 12.5-dpc erythro- 
cytes of the lines aPEIoxl and 1.cre/a 
was digested with either BamHI alone 
or BamHI together with either Msp\ 
(M) or Hpatt <H), which are insensitive 
and sensitive, respectively, to methyla- 
tion at the central CpG dinucleotide of 
their common recognition sequence, 
5'-CCGG. The resulting fragments 
were analysed by Southern blotting 
and hybridization with fragment A 
(Fig. 1a). c, Endonuclease protection 
assay of the transgene locus in 
aPEIoxl and l.cre/a. Nudei prepared 
from primitive erythroid cells of the 

lines aPEIoxl or 1.cre/a were digested with increasing concentrations of £co0109 1. The DNA was then purified and digested with EcoRl to release a 5.0-kb par- 
ent fragment (composed of the tail-head junction between adjacent transgenes) before Southern analysis (top panel). When the membrane was hybridized 
with fragment A (Fig. la), deavage at the £co0109 1 site within the transgene promoter reduced this parent band to a 3.2-kb daughter fragment. The sizes of the 
expected parent and daughter fragments were determined by digestion of purified tail DNA from the line 1.cre/a with either f coRI alone (E) or both EcoRl and 
£co0109 I (E/Eo). The membrane was quantified with a Phosphorlmager and Imagequant version 4.2a software (Molecular Dynamics; bottom panel). The verti- 
cal axis shows the percentage of total signal contained within the daughter fragment at each enzyme concentration for the lines aPEIoxl (closed circles) and 
1.cre/a (open circles). 
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chromatin configuration observed both here and in Arabidopsis 19 
is consistent with a model in which homologous pairing between 
monomers within the array induces heterochromatinization at the 
transgene locus. Support for a model of heterochromatin forma- 
tion is derived from the suppression of copy-number-dependent 
silencing of pigment genes in the Drosophila eye by mutations 
within genes encoding known structural components of hete- 
rochromatin 4 . It remains to be determined whether the repeat- 
induced modification of chromatin structure in mice is dependent 
on the proximity of the locus to nearby blocks of constitutive het- 
erochromatin, as was the case for a brown eye pigment transgene 
in Drosophila?, or whether high-copy arrays autonomously form 
inactive chromatin structures irrespective of their position. The 
arrangement of endogenous loci such as the rRNA, tRNA and his- 
tone genes as high-copy concatamers of a repeated unit suggests 
that multi-copy arrays need not always be subject to repeat- 
induced silencing. An investigation of how silencing is prevented 
at these endogenous loci may prove useful in maintaining activity 
at multi-copy arrays of foreign genetic elements. 



Methods 

Transgenic mice. The aPElox transgene was constructed by insertion of an ' 
oligonucleotide containing a single copy of the 34 bp loxP site 
(5-ATAACTTCGTATAATGTATGCTA TACGAAGTTAT-3') between the 
SDK-lacZpA cassette and the 4.2-kb OtHS-40 fragment of the previously 
described construct aPE 16 . The integrity of the loxP site was confirmed by 
dideoxy sequencing. The aPElox fragment for micro-injection was excised 
from plasmid vector sequences by digestion with HindSH and Kpril and 
purified by agarose-gel electrophoresis. Transgenic mouse lines bearing 
the aPElox construct were generated by standard micro-injection tech- 
niques in the outbred Pathology Oxford (P.O.) mouse strain. For Cre 
recombination, fertilized oocytes were collected from wild-type P.CX 
females mated with hemizygous G x transgenic male mice and micro- 
injected into either pronucleus with the circular Cre-expression vector 
pCAGGS-Cre 14 at 5 ng/uJ before transfer into pseudo-pregnant foster 
mothers. Iive-bom progeny that showed a reduced transgene copy num- 



ber as determined by initial Southern blotting were mated with wild- type 
animals to establish transgenic lines hemizygous for the modified locus. 

Histology. After individual 12.5-dpc embryos had been bled into PBS, 
whole blood cells were gently pelletted and then fixed in 0.25% glutaialde- 
hyde before staining with 5-bromo-4-chloro-3-indolyl P-D-galactopyra- 
noside (X-gal) at 37 °C for 24 h as described previously 30 . 

Nuclear run-on and endonuclease protection assays. To prepare membranes 
for nuclear run-on analysis, probe fragments were excised from plasmid vec- 
tor by digestion with the appropriate restriction enzymes and purified by 
agarose-gel electrophoresis. Probe fragments used were an Hinfl-Hinfl frag- 
ment containing +17 to +956 relative to the transcription start site of the 
mouse a-globin gene; an Xbal-PsA fragment containing +150 to +969 of the 
mouse £-globin gene; and a Psfl-UcoRV fragment of pSDK-tocZpA 30 , which 
includes +39 to +1 164 relative to the start site of the lacZ transcript produced 
by aPElox. Purified fragment DNA (0-2 ug of each fragment) was then elec- 
trophoresed on 12% agarose gel and transferred to a nitrocellulose mem- 
brane by Southern blotting. Nuclei were purified as described 9 from 1.5 xlO 7 
primitive erythroid cells collected from 12.5-dpc transgenic embryos. Run- 
on transcripts were synthesized from isolated nuclei and hybridized to mem- 
branes as described previousl y 31 . For endonuclease protection assays/nuclei 
prepared as described above were separated into four aliquots of 250 ul and 
digested with 0, 40, 80 or 160 U of £cd0109 I at 37 °C for 90 mm. After pro- 
teinase-K digestion and phenol-chloroform extraction, purified DNA was 
digested with EcoBl and Southern analysis performed. 
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Achieving long-term retroviral expression in primary cells has been problematic De novo DNA methylation 
of infecting proviruses has been proposed as a major cause of this transcriptional repression. Here we report 
the development of a mouse stem cell virus (MSCV) long terminal repeat-based retroviral vector that is 
expressed in both embryonic stem (ES) cells and hematopoietic stem (US) cells. Infected HS cells and their 
differentiated descendants maintained long-term and stable retroviral expression after serial adoptive trans- 
fers. In addition, retrovirally infected ES cells showed detectable expression level of the green fluorescent 
protein (GFP). Moreover, GFP expression of integrated proviruses was maintained after in vitro differentiation 
of infected ES cells. Long-term passage of infected ES cells resulted in methylation-mediated silencing, while 
short-term expression was methylation independent. Tissues of transgenic animals, which we derived from ES 
cells carrying the MSCV-based provirus, did not express GFP. However, treatment with the demethylating 
agent 5-azadeoxycytidine reactivated the silent provirus, demonstrating that DNA methylation is involved in 
the maintenance of retroviral repression. Our results indicate that retroviral expression in ES cells is 
repressed by methylation-dependent as well as methylation-independent mechanisms- 



Retroviral vectors are appealing vehicles for gene transfer. 
However, long-term expression mediated by integrated provi- 
ruses in primary cells has been difficult to achieve. Retroviral 
regulatory elements are repressed in numerous cell types, in- 
cluding embryonic stem (ES) cells and hematopoietic stem 
(HS) cells (1, 3). For example, vectors that are functional in 
mature hematopoietic cells are often not expressed in blood 
cells of animals transplanted with the infected stem cells (18, 
19, 31). In particular, the lack of significant provirus transcrip- 
tion in ES cells and their differentiated descendants has ham- 
pered the use of retroviral vectors in transgenic experiments (5, 
12, 32). Interestingly, this block in provirus expression is main- 
tained upon differentiation of infected cells despite the fact 
that primary infection of cells after differentiation results in 
efficient expression (6, 7, 26). 

Transcriptional repression is thought to be mediated by both 
cw-acting de novo methylation of the integrated proviruses and 
cell-type-specific /rans- acting transcriptional repressors (5, 9, 
23). TTie effect of fra/w-acting factors on retroviral expression 
through binding of specific sequences within the promoters of 
retroviruses has been examined in many studies (29, 30, 35). In 
fact, the mouse stem cell virus (MSCV) long terminal repeat 
(LTR) was generated by the modification of the sequences 
within the LTR to increase the affinity for positive factors and 
decrease the affinity for negative regulators (20). 

In contrast, the role of methylation in silencing has been less 
clear. DNA methylation is thought to be a general mechanism 
used by cells to silence foreign DNA and may be involved in 
the cell defense against transposable elements (39). DNA 
methylation has also been associated with the repression of 
gene expression and the silencing of viral control elements (2, 
14, 38). Exogenously introduced retroviruses silenced in vitro 
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and in vivo can be reactivated by treatments that result in 
genomewide demethylation. In addition, transcriptionally si- 
lent endogenous retroviral elements are reactivated upon loss 
of genomic methylation in Dnmtl knockout mice (38). There- 
fore, DNA methylation is thought to causally repress expres- 
sion of retroviral promoters in a variety of cell types. 

ES cells provide a good model to study the role of DNA 
methylation in retroviral silencing. First, it was demonstrated 
that ES cells have high de novo methylation activity, which 
leads to effective methylation of integrated retroviral vectors, 
while little or no de novo methylation activity was detected in 
differentiated cells (21). In addition, ES cells were genetically 
modified to alter the endogenous level of DNA methylation by 
the targeted disruption of the maintenance methyltransferase 
gene Dnmtl. ES cells homozygous for this mutation proliferate 
normally with their genomic DNA highly demethylated, while 
differentiated cells and mice die due to the loss of genomic 
methylation (21, 22). Therefore, these modified ES cells are 
useful to study the effect of DNA methylation on retroviral 
gene expression. In addition, ES cells can be induced to dif- 
ferentiate in vitro or in vivo, allowing the study of DNA meth- 
ylation and its effect on long-term expression. 

Both Moloney virus-based and MSCV-based retroviral vec- 
tors have been used for gene transduction in a variety of cells. 
The MSCV vector is different from the typical Moloney virus 
vector in that the mutations in the LTR have allowed expres- 
sion in a larger host range (8, 20). To this end, we modified 
MSCV to express the green fluorescent protein (GFP) as a 
sensitive reporter for gene expression (37). Using this vector, 
we demonstrated efficient expression in both ES and HS cells. 
We also demonstrated that silencing of retroviruses involves 
two mechanisms: (i) tazray-acting factors that affect the initial 
expression of Moloney virus-based vectors but not MSCV- 
based vectors and (ii) long-term DNA methylation-dependent 
silencing that directly restricts expression of MSCV in ES cells 
and during embryogenesis. Silencing of the MSCV vector in 
wild-type ES cells and in in vivo differentiated ES cells was 
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reversed by 5-azadeoxycytidine (5-azadC) treatments that de- 
methylated the retroviral sequences, demonstrating that DNA 
methylation directly controls the maintenance of retroviral re- 
pression. 



MATERIALS AND METHODS 

Tissue culture. ES cells were cultured as described previously (21). To gen- 
erate ES cell clones for injection into blastocysts, the ES cells were maintained 
on irradiated mouse embryonic fibroblasts (MEFs) with 500 U of leukemia 
inhibitory factor (LIF) per ml (22). For other experiments, the ES cells were 
cultured without MEFs in 1,000 U of LIF per ml. 293 cells were maintained in 
Dulbecco's modified Eagle medium supplemented with 10% fetal bovine serum 
(FBS), penicillin, streptomycin, and glutamine. Abelson virus-transformed B 
cells were maintained in RPMI 1640 supplemented with 10% defined FBS 
(HyClone), penicillin, streptomycin, glutamine, and 50 u,M p-mercaptoethanol. 
ES cells with retroviral integrants were in vitro differentiated as follows: the cells 
were passaged without LIF in the absence of MEFs on bacterial plastic petri 
dishes for 4 days, trypsinized, and cultivated with or without retinoic acid for 2 
weeks (25). 

Plasmids. The retroviral vectors MfgGFP, pMXGFP, and MSCViresGFP 
have been described elsewhere (27, 33, 37). The MSCViresGFP vector was 
modified by introducing either the Cre recombinase or the human Bcl-2 gene 
upstream of the internal ribosome entry site (IRES)-GFP cassette as described 
elsewhere (11, 37). The replication-incompetent helper plasmid pCL-eco was 
used (24). 

Retroviral infections. To generate retroviral supematants, 293 cells were tran- 
siently transfected by calcium phosphate-mediated coprecipitation with 5 u,g of 
the replication-incompetent helper vector pCL-eco and 10 \ig of the reporter 
retroviral vector as stated elsewhere (28). The cells were fed at 24 h postinfec- 
tion, and the retroviral supernatant was used at 48 h. The cells continued to 
produce high-titer retroviruses for another 2 days, and that supernatant was used 
if needed for additional experiments. The supernatant was collected, brought to 
4 u,g of Polybrene per ml-10 roM HEPES, and filtered (0.45-u.m-pore-size filter) 
for use. 

ES cells for infection were washed and trypsinized. They were then plated at 
10 6 cells per well of a six-well dish and centrifuged. The ES cell medium was 
removed, and retroviral supernatant was added at 1 ml/10 6 cells. Next, the plate 
was centrifuged for 45 min at 2,500 rpm at room temperature. The retroviral 
supematants were removed; the cells were resuspended in ES cell medium and 
plated onto gelatinized dishes. ES cells used to generate mice were plated onto 
irradiated MEFs. 

Bone marrow was infected as follows (36). C57BL/6 mice were obtained from 
the Jackson Laboratory (Bar Harbor, Maine). Bone marrow cells were harvested 
from the tibias and femurs of C57BL mice 5 days after they received an intra- 
peritoneal injection of 5 mg of 5-fluorouracil (Sigma) in Dulbecco's phosphate- 
buffered saline (Gibco/BRL). These cells were then cultured for 4 days at 2 X 10 6 
cells/ml with recombinant mouse interleukin-3 (rmIL-3; 20 ng/ml), rmIL-6 (50 
ng/ml), and (50 ng/ml; recombinant mouse stem cell factor; R&D Systems) in 
Dulbecco's modified Eagle medium containing 10% FBS. After 48 and 72 h, the 
bone marrow cells were spin infected with the retroviral supernatant generated 
as described above. Then the retroviral supernatant was removed and replaced 
with growth medium containing cytokines. 

FACS (fluorescence-activated cell sorting) analysis and sorting. Adherent cell 
lines were trypsinized, washed, and resuspended in complete medium to achieve 
a single-cell suspension at the time points indicated. Nonadherent cells were 
used directly for analysis. Organs were disrupted manually and passed through a 
70-um mesh to generate a single^ell suspension. The cells were analyzed for 
viability using scatter properties and the exclusion of propidium iodide. The level 
of GFP expression was monitored by fluorescence without compensation to 
detect cells with low levels of GFP expression. The ES cells were sorted into ES 
cell medium and plated immediately onto either gelatinized plates or MEFs for 
blastocyst injections. The survival of ES cells after sorting was approximately 
50%, as measured by the number of colonies generated divided by the expected 
number of colonies. 

5-AzadC treatments. ES cells were treated with 0.15 u.M 5-azadC (Sigma) at 
days 1 and 3 postplating. The cells were fed, allowed to recover, and then assayed 
4 to 8 days later. The red blood cells in whole blood were lysed (5), and the 
remaining cells were stained with fluorescently labeled anti-H2-b, anti-H2-d, 
anti-B220, anti-TCRa (Pharmingen) at 1:200 as indicated. At day 0, splenocytes 
were treated with either anti-CD3 or anti-CD40 (Pharmingen); 0.15 u»M 5-azadC 
was added at day 1, and the anti-CD3-treated cells were assayed at day 4. 
5-AzadC was added again to the B-cell cultures with fresh anti-CD40 at day 4, 
and the cells were assayed at day 6. 

Staurosporine-mediated cell death. ES cells were infected with the stated 
retrovirus and treated with staurosporine at day 4 postinfection for 24 h with the 
indicated concentration of drug. The percentage of viable, GFP-positive cells was 
determined by flow cytometry (6). Data are presented as a percentage of GFP- 
positive cells before treatment Results from one representative experiment of 
three performed are shown. 



LacZ staining. ES cells were infected with the stated retrovirus and sorted for 
GFP expression at day 3 postinfection. The ES cells were plated and cultured for 
an additional 5 days and stained for LacZ expression as described elsewhere (41). 

Adoptive transfers. Recipient mice (10) received a total of 1,200 rads of 
whole-body radiation in two doses (800 and 400 rads) 3 h apart and were then 
injected with 2 X 10* to 5 X 10 6 infected bone marrow cells. Irradiated mice were 
maintained on trimethaprim-sulfamethoxazole in sterile cages for 4 to 6 weeks to 
prevent opportunistic infections (34). Serial passages were performed by har- 
vesting bone marrow from mice 6 to 8 weeks postreconstitution and transferring 
2 X 10* to 5 X 10* cells into irradiated recipients. Mice were analyzed 8 to 12 
weeks posttransfer to allow reconstitution of the T-cell compartment. These 
experiments were repeated multiple times with similar results. 

Southern blot analysis. The genomic DNA was isolated as described else- 
where (19). Ten micrograms of DNA was digested with the stated restriction 
enzyme overnight. The products were resolved on an agarose gel, transferred to 
a nylon membrane, and detected using a probe that spans the entire GFP coding 
sequence. 

RESULTS 

High-efficiency retroviral expression in ES cells. Retroviral 
vectors based on the MSCV LTR were constructed with a 
multiple cloning site followed by an IRES driving expression of 
the gene for GFP as schematically diagrammed in Fig. 1A 
(MiG) (37). We generated high-titer retroviruses by transient 
transfection and infected ES cells with an adapted spin infec- 
tion protocol. Using this protocol, we reproducibly achieved 
high-efficiency (>50%) infection of ES cells as measured by 
flow cytometry; uninfected control cells were negative for GFP 
expression (Fig. IB). The intracellular concentration of GFP is 
directly proportional to the fluorescence intensity measured by 
flow cytometry. 

Next, we compared expression of the MSCV-based retrovi- 
rus and Moloney virus-based retroviral vectors in ES cells. 
GFP expression was detectable with the MSCV LTR-contain- 
ing MiG vector but not with the two Moloney virus-based 
viruses pMX (27) and Mfg (33) (Fig, IB). This was not due to 
inefficient genomic integration of the provirus or to a lower 
titer. Southern blot analysis of genomic DNA demonstrated 
that all three proviruses were integrated in the ES cells (Fig. 
1C). Also, when parallel B-cell cultures were infected with the 
retroviral supernatant used to infect ES cells, all of the retro- 
viral vectors were expressed in B cells at comparable efficien- 
cies (Fig. IB). 

GFP expression driven by the MSCV LTR in ES cells was 
substantially lower than in other differentiated cell lines tested 
(Fig. IB and data not shown) (20). To determine whether this 
low level of expression was sufficient to drive functional ex- 
pression of other gene products, we cloned the Cre recombi- 
nase upstream of the IRES-GFP cassette to generate 
MSCVCreiresGFP. We tested for Cre activity by infecting ES 
cells that contain a translational stop sequence flanked by loxP 
sites located between the Rosa 26 promoter and a lacZ re- 
porter schematically diagrammed in Fig. 2A (34). If Cre is 
expressed at functional levels in these ES cells, the protein will 
catalyze recombination of the loxP sites, leading to loss of the 
stop sequences and the expression of LacZ. Indeed, we found 
that >99% of GFP-positive ES cells that were infected with 
the Cre-expressing retrovirus were also LacZ positive (Fig. 
2B). Uninfected cells were both GFP negative and LacZ neg- 
ative (data not shown). This indicates that virus-mediated gene 
transfer resulted in functional Cre expression. 

Because Cre activity is required only transiently for LacZ 
expression, we tested a second gene product that must be 
stably expressed throughout the experiment. It has been dem- 
onstrated that Bcl-2 expression protects many cell types against 
staurosporine-mediated apoptosis (10). Therefore, we exam- 
ined whether Bcl-2 could protect ES cells from apoptosis when 
expressed from the MSCV LTR. We cloned human Bcl-2 up- 
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FIG. 1. Efficient retroviral infection of ES cells. (A) Schematic diagram of 
MiG vector containing the MSCV LTR followed by a multiple cloning site 
(MCS) and an IRES-GFP cassette. (B) MSCV-based (MiG) but not Moloney 
virus-based (Mfg and pMX) retroviruses express in ES cells. B cells or ES cells 
were infected by the indicated retroviruses and assayed by flow cytometry 2 days 
postinfection. Uninfected cells (unshaded) and infected cells (shaded area) were 
electronically gated for live cells and subsequently analyzed for GFP fluores- 
cence and for cell number. Percentages of GFP-positive cells are indicated. (C) 
Comparable levels of integration of different retroviruses into ES cells, deter- 
mined by Southern blot analysis of genomic DNA purified from infected and 
uninfected ES cells 2 days postinfection, digested with KpiiU a restriction site 
present within the LTRs, and probed with the GFP coding sequence. 



stream of the IRES-GFP cassette to generate MSCV Bel- 
2iresGFP. Wild-type ES cells were infected with either the 
BcJ-2-expressing retrovirus or the control virus lacking Bcl-2. 
Increasing concentrations of staurosporine were added to the 
cultures, and flow cytometry was used to assay for both viability 
and GFP expression. GFP-positive cells infected with the Bcl- 
2-containing virus were significantly protected from staurospo- 
rine-mediated cell death compared to the GFP-negative cells 
or GFP-positive cells infected with the control retrovirus (Fig. 
2C). Therefore, the level of expression from the MSCV LTR is 
sufficient for stable functional gene expression in ES cells. 

Short-term transcriptional silencing in ES cells is methyl- 
ation independent It long has been hypothesized that retrovi- 
ruses are transcriptionally silenced in embryonic cells by DNA 
methylation (12, 14, 21). Therefore, it was possible that DNA 
methylation of the MSCV LTR was responsible for the de- 
creased level of expression in ES cells compared to other cell 
types (Fig. IB). In addition, we sought to test whether DNA 
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2. Expression from the MSCV LTR is sufficient to drive functional gene 
expression. (A) Schematic diagram of the Rosa 26 locus in Cre reporter ES cells. 
Before Cre-mediated recombination, LacZ expression is prevented by the pres- 
ence of a stop fragment. Retroviral infection with a Cre-expressing retrovirus 
with a GFP reporter results in two populations of cells. Cells that are GFP + 
become LacZ + due to efficient Cre-mediated recombination of the stop frag- 
ment. In contrast, cells that are GFP' were not infected and thus remained 
LacZ". (B) ES cells were infected with the MSCVCreiresGFP retrovirus and 
sorted for either Gfp - or Gfp + as indicated. The cells were subsequently cul- 
tured and stained for LacZ expression. Gfp~ cells are white (and therefore 
LacZ~) while Gfp + cells are blue (and therefore LacZ" 1 "). More than 99% of the 
Gfp + cells were LacZ" 1 " in multiple experiments. (C) ES cells were infected with 
either MSCViresGFP (■) or MSCVBcl-2iresGFP (□) and treated with the 
indicated amounts of staurosporine. The percentage of viable, Gfp + (infected) 
cells was determined by flow cytometry. The results are shown as a percentage of 
Gfp + cells before treatment. The results are from one representative experiment 
of three performed. 



methylation of the Moloney vims-based vectors in the wild- 
type ES cells was the mechanism by which the Moloney virus- 
based LTRs were silenced (9, 13). To this end, we infected ES 
cells deficient for the maintenance DNA methyltransferase 
gene, Dnmtl, the loss of which results in genomewide hypo- 
methylation (21, 22). Dnmtr l ~ ES cells are demethylated, and 
proviral sequences remain unmethylated. The Moloney virus- 
based retroviruses such as pMX remained silent even when 
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FIG. 3, Long-term expression of retroviruses is repressed by methyl alion. (A) MSCV-based (MiG) but not Moloney virus-based (pMX) retroviruses express in ES 
cells independent of the methylation status of the cells. Dnmil H * or Dnmtl - '' ES cells were not infected (unshaded) or infected by the indicated retroviruses (shaded) 
and assayed by flow cytometry 2 days postinfection as for Fig. IB. (B) Long-term expression of GFP in ES cells is decreased in Dnmtl* /+ cells but not Dnmtl~'~ cells. 
The ES cells were infected with MiG, passaged for 5 or 26 days postinfection, and assayed by flow cytometry as above. The mean fluorescent intensity for the population 
and the percentage of GFP-positive cells are indicated. (C) Treatment with 5-azadC rescues the expression of retroviruses silenced by long-term passage. Dnmtl +,+ 
or Dnmtl' 1 ' ES cells were infected by MiG and passaged for >40 days. The cells were divided, and half were treated with 5-azadC. Then uninfected ES cells 
(unshaded), MiG-infected untreated ES cells (dark shading), and MiG-infected 5-azadC-treated ES cells (light shading) were assayed by FACS analysis. Numbers below 
the FACS plots are percentages of GFP-positive cells before and after 5-azadC treatment. (D) In vitro differentiation of ES cells does not affect retroviral expression. 
A clonal ES cell line (R7) infected with MiG or an uninfected wild-type (WT) ES cell control was in vitro differentiated by passage without feeders and LIF, with or 
without retinoic acid (RA) as indicated. The cells were assayed by flow cytometry, and the percentage of GFP-positive cells is indicated. 



introduced into Dnmtl ~'~ ES cells, whereas MSCV expressed 
similar levels of GFP in both Dnmtl and Dnmtl ~'~ ES cells 
(Fig. 3A). Therefore, the initial block in transcription directed 
by Moloney virus LTRs in ES cells is independent of DNA 
methylation and is presumably due to the binding of trans- 
acting factors. In addition, the mean fluorescence intensities of 



GFP were comparable between the Dnmtl* /+ and Dnmtl' 1 ~ 
ES cells, indicating that the basal level of expression of the 
MSCV LTR is independent of DNA methylation. 

DNA methylation constrains long-term retroviral expression. 
MiG-infected GFP-expressing ES cells were continually passaged 
to test the effect of DNA methylation on long-term expression. 
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Though GFP expression was high in both Dnmtl~ f ~ and 
Dnmtl +/+ ES cells at 5 days postinfection, a substantial frac- 
tion of the infected wild-type ES were GFP negative at 26 days 
postinfection. This was apparent by both a loss in the percent- 
age of GFP-positive cells as well as a decrease in the mean 
fluorescence intensity of the bulk population of wild-type ES 
cells and was observed in both bulk cultures and individual 
cloned lines containing single integrants (Fig. 3B and data not 
shown). The fraction of GFP-positive cells continues to de- 
crease with additional passages, as shown in Fig. 3C. These 
results suggest that long-term expression was suppressed by 
DNA methylation. To directly test whether retroviral repres- 
sion was due to de novo methylation of the newly integrated 
retroviruses, we treated the long-term cultures with 5-azadC, a 
drug that leads to hypomethylation of genomic DNA (16). If 
DNA methylation was preventing expression of the MSCV 
LTR, treatment with the drug should activate retroviral ex- 
pression. Indeed, we found that 5-azadC treatment of ES cells 
that had lost expression of GFP through long-term passage 
reactivated the provirus (Fig. 3C). In contrast, Dnmtl~ J ~ ES 
cells infected with the retrovirus did not lose expression of 
GFP; thus, treatment with 5-azadC did not significantly affect 
retroviral expression (Fig. 3C). We also analyzed clonal lines 
containing single proviral integrants in which GFP expression 
was progressively silenced and found that treatment with 
5-azadC resulted in the reactivation of gene expression in all 
cases (data not shown). This demonstrates that DNA methyl- 
ation controls long-term but not short-term expression of ret- 
roviruses in ES cells. 

Expression is maintained after in vitro differentiation. Pre- 
viously, in vitro differentiation of ES cells had been demon- 
strated to silence expression of retroviral sequences (12, 20). 
Thus, we tested whether GFP expression from the MiG retro- 
virus in ES cells was affected by in vitro differentiation. We 
cultured MiG-infected wild-type ES cells in the absence of 
embryonic feeder cells and LIF in suspension to generate em- 
bryoid bodies. Disaggregated embryoid bodies were replated 
either with or without retinoic acid. We found no change in 
GFP expression in MiG-infected bulk cultures or individual 
subclones containing one to several integrants upon in vitro 
differentiation with either method, as shown for one clonal line 
containing multiple integrants in Fig. 3D. GFP expression was 
unchanged in all in vitro-differentiated ES cell lines, regardless 
of whether the subclones contained only a single or multiple 
integrants. This indicates that the MSCV-based MiG retrovi- 
rus is not silenced by in vitro differentiation. 

Generation of mice from GFP-expressing MiG-infected ES 
cells. We next determined whether expression of the MSCV- 
based MiG vector was affected by in vivo differentiation of the 
infected ES cells. Cells from the chimeric animals were derived 
by injection of MiG-infected wild-type ES cells (derived from 
129-Sv/Jae mice) into BALB/c blastocysts. MiG-infected 
Dnmtl~^ ES cells cannot be used for injection into blasto- 
cysts, because Dnmtl - ^ ES cells die upon differention and 
therefore do not contribute significantly to adult mice (22). 
MiG-infected wild-type ES cells were sorted for GFP expres- 
sion by flow cytometry prior to injection, and two GFP-express- 
ing clones, R2 and Rll, were isolated (Fig. 4B). Southern blot 
analysis demonstrated that R2 contained two integrants that 
comigrate on an agarose gel, and Rll contained three proviral 
integrants (Fig. 4A). High-contribution chimeras (>80% by 
coat color) were generated from the R2 and Rll ES cells, 
which transmitted the proviruses to their offspring (data not 
shown). 

To test whether the chimeras expressed the integrated ret- 
roviruses, we isolated peripheral blood mononuclear cells 



RETROVIRAL EXPRESSION IN ES AND HS CELLS 7423 

A R2 R11 




B 

R2 R11 




FIG. 4. Retrovirally infected, GFP-expressing ES cells generated nonex- ^ 
pressing mice. (A) Two retrovirally infected clones sorted for GFP expression cr 
were analyzed for proviral integrants by Southern blot analysis. R2 contained two ? 
integrants, while Rll contained three. Uninfected cells are negative. (B) The ^ 
clones were passaged after sorting for GFP-expressing cells by flow cytometry ~ 
and reanalyzed for GFP expression. Both R2 and Rll express GFP (shaded) o 
compared to uninfected controls (unshaded). (C) PBMCs from the R2 chimera <o 
(more than 50% contribution by coat color) were analyzed by flow cytometry. ES 
cell contribution to the chimera was determined by phycoerythrin-H2-b staining 
and cyc-TCRa staining and demonstrated contribution to the T-cell compart- 
ment The percentage of cells in each quadrant is listed. The cells were also 
monitored for GFP expression. The percentage of GFP* cells that are either 
major histocompatibility complex (MHC) class I H2-b + (ES cell derived) or 
H2-b~ (blastocyst derived) is listed in the quadrant. 



(PBMCs) from both the R2- and Rll-derived chimeras. To 
distinguish whether the PBMCs were derived from the ES cell 
donor or the host blastocyst, we stained the cells with antibod- 
ies that recognized specific major histocompatibility complex 
class I haplotypes (Pharmingen). The donor ES cells (129 
derived) are H2-b, and the blastocysts (BALB/C derived) are 
H2-d (Fig. 4C and data not shown). In addition, we stained the 
PBMCs with a pan-T-cell (TCRa) (Fig. 4C) or pan-B-cell 
(B220) antibody (data not shown) to determine the ES cell 
contribution to these lineages. Using this strategy, we found 
that approximately 90% of the PBMCs from either the R2 or 
Rll chimera were ES cell derived as measured by H2-b stain- 
ing (Fig. 4C, data not shown). However, the majority of the 
cells did not express GFP in either chimera (Fig. 4C and data 
not shown). On the order of 0.1% of the PBMCs that were ES 
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cell derived were GFP positive, compared to less than 0.01% 
that were blastocyst derived (Fig. 4C). Similar results were also 
obtained with cells from the Rll chimera (data not shown). 
The results indicate that the MSCV LTR is repressed during in 
vivo differentiation to lymphocytes. Nevertheless, a small num- 
ber of cells escaped silencing and expressed GFP. This tran- 
scriptional repression of the MiG provirus in the chimeras is in 
contrast to the GFP expression both in the donor ES and after 
in vitro differentiation (Fig. 4B, data not shown). 

To determine if other somatic cells expressed the retroviral 
integrants, we analyzed the progeny of the chimeras. We iso- 
lated spleen, thymus, kidney, and liver cells from an animal 
carrying the two proviral integrants present in the R2 chimera 
and a littermate control containing no retroviral integrants. We 
analyzed these cells for GFP expression by FACS analysis and 
found no detectable expression of GFP in the splenocytes, 
thymocytes, renal cells, or hepatocyes (Fig. 5A and data not 
shown). 

In vitro reactivation of retroviral expression. One possible 
explanation for transcriptional repression during in vivo differ- 
entiation was de novo methylation of the integrated retroviral 
LTR during embryonic development. To test this hypothesis, 
we cultured splenocytes from a mouse containing the R2 pro- 
viruses and from a littermate control, by treating the cells with 
either anti-CD3 or anti-CD40 to activate and induce prolifer- 
ation of the T cells or B cells, respectively (4). We then assayed 
for GFP expression by flow cytometry and found that prolif- 
eration of the splenocytes did not activate expression of the 
retrovirus (data not shown). Next, we added 5-azadC to the 
splenocyte cultures to induce demethylation of the retrovi- 
ruses. Indeed, treatment with 5-azadC activated expression in 
approximately 2% of the T cells (anti-CD3) (Fig. 5B) and 2% 
of the B cells (anti-CD40) (data not shown). In addition, when 
in vivo-differentiated cells, which had been isolated from the 
kidney of a transgenic mouse and transformed with simian 
virus 40 large T antigen (15), were treated with 5-azadC, acti- 
vation of the silent provirus was observed in a similar fraction 
of the cells (data not shown). The extent of reactivation of 
expression of the provirus in in vivo-differentiated cells by 
5-azadC was lower than in ES cells, where the reactivation of 
the provirus with 5-azadC was almost complete. 

We next detennined whether demethylation of the retrovi- 
rus in vivo would activate expression of the integrated retrovi- 
ruses (13). Newborn mice were subcutaneously injected with 
5-azadC at postnatal day 5 and subsequently analyzed at post- 
natal day 14 for GFP expression by flow cytometry. We found 
that 5-azadC-injected animals but not the uninjected controls 
had activated GFP expression of the proviruses in the spleen, 
thymus, and kidney (Fig. 5C and data not shown). When we 
injected higher concentration of 5-azadC in an effort to further 
demethylate the newborn mice, all injected animals died. This 
result demonstrated that repression by DNA methylation is, at 
least in part, responsible for silencing expression of the retro- 
viral UTR in vivo. 

Retroviral expression in HS cells after serial adoptive trans- 
fers. Bone marrow contains the HS cells that can stably repop- 
ulate the hematopoietic system after transfer to lethally irra- 
diated mice. To determine whether HS cells can be effectively 
transduced and express the MiG retrovirus, we used infected 
bone marrow cells to reconstitute lethally irradiated mice (Fig. 
6). We found that between 30 and 80% of the splenocytes from 
these primary recipients expressed the retrovirus, as measured 
by FACS analysis for GFP expression and shown for one rep- 
resentative experiment (Fig. 6A). The MiG virus was expressed 
in the B-cell, T-cell, and granulocyte compartments, as mea- 
sured by a pan-B cell (B220), pan-T-cell (Thy-1), and pan- 
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FIG. 5. Silenced retroviruses can be reactivated with 5-azadC. (A) Spleno- 
cytes from an R2 + or R2~ littermate do not express GFP by flow cytometry. 
Cells were stained with propidium iodide to exclude dead cells, and the percent- 
age of GFP + cells is indicated. (B) The splenocytes from panel B were induced 
to proliferate with anti-CD3 and treated with 5-azadC. Cells were stained with 
propidium iodide to exclude dead cells and analyzed by flow cytometiy. The 
percentage of GFP + cells is indicated. (C) Flow cytometric analysis of the 
splenocytes of littermates that were either uninjected or injected with 5-azadC at 
passage 5 and analyzed at passage 14 for GFP expression. The percentage of 
GFP + splenocytes is indicated. 



granulocyte (Gr-1) marker electronically gated on GFP-posi- 
tive cells (Fig. 6B and data not shown). Because a large 
fraction of the splenocytes in the primary recipients are de- 
rived from relatively differentiated, lineage-committed progen- 
itors, serial adoptive transfers are required to test for retroviral 
expression in the true HS cells (17). Therefore, we used bone 
marrow from these primary recipients to serially reconstitute 
lethally irradiated mice. This protocol requires substantial ex- 
pansion from the stem cells and tests for long-term expres- 
sion of the retrovirus. We observed no change in the per- 
centage GFP-positive HS cells, and the level of GFP 
expression from the adoptive transfers into multiple recip- 
ients was stable over three additional passages (4° recipi- 
ent). In addition, the infected cells gave rise to both B- and 
T-cell lineages at the expected ratios (Fig. 6B), demonstrating 
not only that the MiG retrovirus transduced the long-term 
repopulating HS cells but also that the MiG-mediated GFP 
expression was stable during in vivo hematopoietic differenti- 
ation. However, our results do not exclude the possibility that 
in addition to the transcriptionally active proviruses present 
within these cells, there are also copies of the virus that were 
transcriptionally silenced. 
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FIG. 6. Serial adoptive transfers maintain expression of the MSCV-based 
retrovirus. (A) Bone marrow was infected with MiG and used to reconstitute 
multiple lethally irradiated mice to generate the 1° recipient. The spleen of the 
1° recipient was analyzed for GFP expression by flow cytometry. Hie bone 
marrow of the 1° recipient was used to reconstitute lethally irradiated 2° recip- 
ients. The spleen of a 2° recipient was analyzed for GFP expression, and the bone 
marrow was used to reconstitute lethally irradiated 3° recipients. The spleen of 
a 3° recipient was analyzed for GFP expression, and the bone marrow was used 
to reconstitute lethally irradiated 4° recipients. A representative analysis is 
shown. (B) splenocytes from pane] A, stained with pan-B-cell (B220) and pan- 
T-cell (Thy-1) antibodies and electronically gated for GFP + cells, are shown 
below the GFP histogram they are derived from. The FACS diagrams are shown 
for these serially reconstituted spleens, demonstrating that the transferred cells 
contribute to both B- and T-cell lineages in the appropriate ratios. 



DISCUSSION 

We have investigated the role of DNA methylation in ret- 
roviral silencing. Retrovirus-based studies of stem cells have 
been hampered by the lack of expression. We have overcome 
the transcriptional repression in ES cells by using an MSCV- 
based vector in combination with a sensitive GFP reporter 
gene (MiG vector). The analysis of expression of the MiG 
vector and other Moloney virus-based vectors in Dnmtl~ ! ~ 
and Dnmtl +/+ ES cells has allowed us to determine whether 
DNA methylation directly controls retroviral gene expression 
in these cells. We found that both methylation-dependent and 
methylation-independent mechanisms exist to control retrovi- 
ral gene expression. 

Historically, retroviral expression of Moloney virus-based 
vectors in ES cells has been negligible. In contrast, the MSCV 
LTR not only transduces GFP expression in ES cells but also 
expresses other exogenous gene products such as the Cre re- 
combinase and the antiapoptotic factor Bcl-2 at detectable 
level in ES cells. Therefore, the MSCV LTR can be used to 
express various transgenes in ES cells and their differentiated 
descendant cells. 

It had been proposed that DNA methylation has evolved as 
a cellular mechanism to silence retroviral elements, preventing 
the spread of transposable elements through the genome (39). 
Indeed, de novo methylation of integrated proviral sequences 
has been observed in wild-type ES cells, which was correlated 
with the transcriptional silencing of the retrovirus (14). Our 
findings are the first demonstration that inhibition of the 
Dnmtl methyltransferase gene prevents silencing of the retro- 
viruses in ES cells. This result provides direct evidence that 
DNA methylation is causally involved in long-term retroviral 
repression. Consistent with this conclusion is the demonstra- 



tion that the transcriptionally silenced proviruses present in 
long-term Dnrntl^ ES cell cultures can be reversed by treat- 
ments with 5-azadC. 

In contrast, methylation-independent mechanisms deter- 
mine initial retroviral expression in ES cells. Wild-type or 
Dnmtl ES cells infected with Moloney virus-based vectors 
were transcriptionally silent, and therefore this silencing was 
independent of the DNA methylation status of the cells. More- 
over, the basal level of expression from the MSCV-based vec- 
tor was unaffected by the methylation status of the cells. This 
formally demonstrates that DNA methylation-independent 
mechanisms control initial retroviral gene expression in ES 
cells. Because the basal level of expression of the MSCV LTR 
in ES cells is lower than in differentiated cell types and not 
affected by the methylation status of the ES cells, trans-acting 
factors must regulate the initial level of expression. 

Previous studies found that retroviruses, including the 
MSCV LTR, are silenced by the in vitro differentiation process 
(20). In contrast, we found for the first time that expression of 
this MSCV-based retrovirus in ES cells was maintained after in 
vitro differentiation with and without retinoic acid. We were 
also able to show long-term, stable GFP expression from the 
MiG vector in HS cells and their differentiated derivatives. 
MiG-mediated GFP expression from HS cells was stable 
through serial adoptive transfers, and the HS cells gave rise to 
GFP-expressing B- and T-cell lineages. Therefore, this MSCV- 
based retroviral transduction system should allow for a molec- 
ular analysis of stem cell biology and differentiation programs 
by forced expression of exogenous gene products. 

It has been postulated that methylation-dependent mecha- 
nisms repress retroviral gene expression upon in vivo differen- 
tiation (13, 20). To test this, we injected GFP-expressing un- 
differentiated ES cells into recipient blastocysts and generated 
chimeric mice. Differentiated tissues derived from these in 
vivo-differentiated ES cells, such as PBMCs, lacked significant 
GFP expression. Treatment of ES cell-derived differentiated 
cells with 5-azadC in vitro or in vivo led to partial reactivation 
of expression of the silenced retroviruses in lymphoid and 
nonlymphoid tissues. We conclude from these results that the 
maintenance of retroviral silencing in vivo involves DNA meth- 
ylation. However, only a small fraction of the 5-azadC-treated 
cells reactivated GFP expression, unlike the long-term ES cell 
cultures, in which every cell reactivated GFP expression. This 
suggests that methylation-independent mechanisms exist to 
suppress retroviral expression. Alternatively, 5-azadC treat- 
ment of differentiated cells, in contrast to ES cells, may not 
lead to a level of genomic demethylation sufficient for com- 
plete retroviral reactivation. The transgenic animals carrying 
the silenced MiG proviruses will be a valuable indicator for in 
vivo activation of GFP expression under different conditions. 
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SUMMARY 

Transgenic mouse lines were established bearing tandem arrays of a fusion construct comprising the promoter region of 
a housekeeping gcnb^HMGCR, encoding 3-hydroxy 3-methylgiutaryl CoA reductase, linked to a bacterial cat reporter gene 
encoding chloramphenicol acetyltransferase (CAT). CAT activity was observed in all transgenic mouse tissues examined. 
The methylation state of the fusion transgene was determined. In non-transgenic mice the endogenous HMGCR promoter 
is devoid of methylation while flanking regions are extensively modified. In HMGCR-cat transgenic mice the fusion gene 
promoter was found to be similarly hypomethylated. However, the extent of hypomethylation varied with copy number: 
methylation-free states was progressively lost with increasing transgene copy number. Further transgenic mouse lines were 
constructed carrying' a truncated HMGCR regulat6ry region linked to cat. Transgene expression and hypomethylation were 
observed in testis but not in any other tissue, and testis-specific methylation-free status was also lost at high copy number. 
Loss of hypomethylation at high copy number may indicate that saturable DNA-binding factors normally protect the 
HMGCR promoter from methylation. 



INTRODUCTION * 

In contrast to~"tne largely methylation-free genomes of 
bacteria and invertebrates, vertebrate genomes have exten- 
sive 5-methylation of C residues within CpG doublets 
(Grippo et al., 1968), and under-representation of CpG has 
been attributed to spontaneous deamination of 5-methyl-C 
(Bird, 1980). Th e relative m sensitivhy of yertebrateDNA to . 
cleavage with enzymes whose recognition sequence 
contains a CpG doublet, for instance ffpaU (CCGG), is a 

Correspondence to: Dr. R. Lathe, CAGR, King's Buildings, West Mains 
*oad, Edinburgh EH9 3JQ (UK.) TeL 31 6684541; Fax 31 6670164. 

Abbreviations: bp, base pair(s); CAT, Cm acetyltransferase; cat, gene 
Coding CAT; Cm, chloramphenicol; HMGCR, 3-hydroxy-3-methyi- 
CoA reductase; HMGCR, gene encoding HMGCR; HTF, HpaU 
S fragment; kb, kflobase(s) or 1000 bp; nt, nucleotide(s). 



consequence both of the rarity of CpG doublets and of 
cleavage inhibition by methylation of the recognition 
sequence. However, a small fraction of the genome is effi- 
ciently cleaved by HpaU (Cooper et al., 1983) and such 
regions, 'fTpfln-tiny-fragment' (HTF) islands, rich in un- 
methylated CpG doublets, appear to be associated with 
actively transcribed genes (Lavia et al., 1987) and particu- 
larly^th the prompter, regions, of Mu^keeping_genes_ 
{Bird, 1986; Gardiner-Garden and Frommer, 1987). Pro- 
moter regions of ubiquitously-expressed genes are unusual 
in that they are G/C-rich and lack consensus transcription 
initiation signals such as the TATA' or 'CAAT boxes, and 
ubiquitous expression has been attributed, to hypomethyla- 
tion of control regions, (reviewed by Bird, 1986; 1987; 
Dynan, 1986; Gardiner-Garden and Frommer, 1987; 
Cedar, 1988). Methylation can correlate with diminished 
expression in vivo and in vitro (reviewed by Cedar, 1988); 
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however, the causal link between hypomethylation and gene 
expression is unclear. 

HMGCR (3-hydroxy-3-methylglutaryl CoA reductase) is 
a membrane-bound glycoprotein that catalyses a key step 
in the synthesis of cholesterol, an essential component of 
the cell membrane (Brown and Goldstein, 1980; Luskey, 
1986). The HMGCR genes of hamster and human are char- 
acterized by a noncoding first exon and a long (ca. 3.5 kb) 
. jntervening3.equence prior to thejtranslation_s.tart site in the 
second exon (Reynolds etal., 1985; Luskey, 1987). We 
recently isolated the homologous mouse HMGCR gene and 
the promoter region, as for the hamster and human genes, 
bears the hallmarks of a HTF island in high G/C content 
and a CpG/GpC ratio close to unity (M.M. and R.L., 
unpublished). 

We have used the transgenic mouse system to explore 
HTF island hypomethylation. Pronuclear microinjection of 
DNA into fertilized mouse eggs is now an established tech- 
nique for generating mice bearing new gene combinations. 
In the majority of cases the injected DNA integrates at a 
single site as a multiple tandem repeat, and transgene copy 
number varies considerably between different transgenic 
lines (reviewed by Palrniter and Brinster, 1986). To explore 
possible variation in transgene functional status with copy 
number, two different fusion genes between the HMGCR 
promoter region and a cat reporter gene were constructed 
and introduced into the mouse germline. Employing these 
transgenic animals we endeavoured to study the relation- 
ship between transgene copy number, expression, and 
hypomethylation of the transgene promoter. 

RESULTS AND DISCUSSION 

(a) Transgenic animals bearing the HMGl-cat fusion con- 
struct 

Construct HMGl-cat comprises a 5.5-kb BamHI frag- 
ment of the mouse HMGCR promoter region, containing 
1.35 kb of upstream sequence, the first (noncoding) exon 
and the first intron, linked, to cat (Fig. IA). Hie. fusion 
construct was injected into fertilized mouse eggs and lines 
of transgenic mice were obtained carrying between 10 and 
260 tandem copies of the transgene. These anim als all 
express the fusion transgene in all tissues examined 
irrespective of transgenic line (Table I; also not presented): 
no proportionality was observed between expression level 
and copy number (M.M. and R.L., unpublished) as 
recorded (Palrniter and Brinster, 1986) in other transgenic 
systems. 

(b) Methylation status of the HMGl-cat transgene 

To assess the in vivo methylation status of the HMGCR 
promoter, we measured the extent to which template 
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Fig. 1. Structure and analysis of the HMGCR-cat fusion transgene. 
(A) Structure of the injected construct HMGl-cat. The first HMGCR 
exon (noncoding) and the untranslated 5 ' portion of the HMGCR second 
exon are indicated by open boxes ; car, hatched; the DNA sequence S V-A | 
downstream from cat contains SV4& splice-andpolyadenylation- signals 
derived from plasmid pSV2-CAT (Gorman et aL, 1982). (B) Transgene 
restriction map. Above, restriction sites are B, Bam HI; B ° , a Bam HI site 
destroyed during the cloning procedure; P, PvwIE, E, EcoKL, S, Smal; 
flflnVi-n g AfrifT sites (N) used for excision- of the DNA construct prior to 
rnicTOinjection into fertilized mouse eggs are derived from the plasmid 
vector. Behw t HpaHjMspl sites; the group of sites beneath the PvuU site 
(P) is a cluster of seven sites over a region of 225 bp (spacing: 20, 25, 80, 
30, 50 and 20 nt; unpublished data). (C) Hybridization probes employed 
to determine methylation status; the probe segment employed in the 
Southern analysis of Fig. 2 is marked with an asterisk. (D) DNA seg- , 
ments analysed for methylation inhibition of excision in Fig. 3. 
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Fig. 2. Representative Southern analysis of HpallfMspl digests. liver 
DNA was prepared by a standard proteinase K/SDS/phenol procedure; 
10 jig aliquots were digested with excess Mspl (m) or HpaU (h), resolved 
by 1% agarose gel electrophoresis, and subjected to Southern analy^s 
using a radiolabeled probe (marked with an asterisk in Fig. 1C) design 
to detect the 400-bp segment 2 (arrowed) in Fig. ID. Animals were- 
(a) wild-type; (b) transgenic line 74; (c) line 40; (d) line 78. The Eg** 
bands of cleavage-resistant DNA are above the highest M T rotf** 
employed (21.5 kb; not shown). 
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modification was able to inhibit excision of different trans- 
gene DNA segments from the genome. Total liver DNA 
from representative animals of different transgenic lines 
wa s cleaved either with Hpall or with the methylation- 
iflsensitive isoschizomer Mspl, and after gel electrophoresis 
and Southern blotting examined for hybridization to sepa- 
rate probes (Fig. 1C) covering different regions of the 
jfldGCR gene. The extent of methylation within each seg- 
ment (Fig. ID) of the transgene was assessed by the ratio 
of band intensities in the Hpall and Mspl lanes. A typical 
experimental result is presented in Fig. 2 (zone analysed: 
Segment 2 in Fig. ID); data obtained for segments covering 
the entire transgene are compiled in Fig. 3. 

Whereas excision of segments within the endogenous 
BMGCR promoter (two copies per diploid genome) in nor- 



mal mice was not detectably blocked by methylation 
(<2%), flanking regions were resistant to digestion with 
Hpall (Fig. 3a). In transgenic mice carrying ten copies of 
the HMGl-cat construct the HMGCR transgene promoter 
remained demethylated although the region devoid of 
methylation was narrower than that of the HMGCR gene in 
normal mice (Fig. 3b). However, methylation of the trans- 
gene promoter was observed to increase progressively in 
animals carrying 22 (35% methylation-iiihlbition of seg- 



bfAOCR 



car SV-A 



j± \ promoter 



1 Kb 



100- 



50 



1111 



2 3 



8 9' 10' 



C 

o 

B. CO 

c S 

0 <D 

>. © 

CD Ol 

1 S 

£ o 
a 

a> 
a 




lllllll 



2 3/ 



llll 



p 



11« 



II 



4 5 6 7 8 9 10 11 



Will™ 



It 



a 1 1 II 111 



e 



3 4 5 6 T 8 9 10 11 

segment number 
fig. 3. Compilation of methylation data. Representative animals carry- 
ing different transgene copy numbers were: (a) wild type (2 copies of the 
HMGCR promoter); (b) transgenic line 74 carrying ten copies of the 
HMGl-cat fusion transgene); (c) line 40, 22 copies; (d) line 80, 45 copies; 
(e) line 78, 260 copies. Vertical axis: the extent of methylation inhibition 
of segment excision, determined as the ratio of the intensities of the 
relevant bands in the Hpall and Mspl lanes of liver DNA (e.g, Fig. 2). 
Unshaded areas ab ove certain histo gram bars indic ate 'greater-than* 
values. Horizontal axis: segment number (see Fig. ID); rnternally- 
kbelled probes used to deterniine methylation status (Mehtah, 1988) are 
Presented in Fig. 1C Quantitative scanning densitometry of autoradio- 
frajus (GS300 Scanner, Hoeffer Scientific) was used to determine the 
%an/Af.jpIexcisionratio. Segment 1 comprises two equally-sized Hpall 
fragments that were not resolved by gel electrophoresis; segment 10 
^Ups an HpaU 'slow* site within cat that is partially refractory to 
tytafl cleavage. Segments 9 and 10 represent the next two HpaU frag- 
ments present in the endogenous HMGCR gene but not in the HMGl-cat 
tl ^gene. For technical reasons the excision ratio for segment a3 wes not 
det *nriined {nd). 



B 



NB° E S SP°B° E 

1 1 ' i m i 



N 



Hpall sites 




860 ^ 



420 ^ 



Copy N° 11 170 

Fig. 4. Methylation state of the HMG2-cat transgene. (A) Structure of 
HMG2-cat: this constru ct is a d erivative of HMGl-cat, i n whic h the region 
between the PvuTL site in the HMGCR first exon and the site marked B ° 
at the 5' end of cat in HMGl-cat (see Fig. 1A,B) has been removed. 
(B) Restriction site nomenclature is as in Fig. IB; the solid bars given 
below indicate the segments detected by. the hybridization probe 
employed (DNA fragment comprising cat and the SV4Q splice and poly- 
adenylation signals, not shown). (Panels C) Southern hybridization of 
Mspl (m) and Hpall (h) digests of total liver (Ii) and testis (Te) DNA 
from representative animals bearing eleven (line 61) or 170 copies (line 
76) of the fusion transgene. HpaU fragments detected (sizes in bp) are 
arrowed High-i/ r band sizes and all experimental conditions are as in 
Fig, 2 legend. 
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TABLE I 



CAT activity in tissues of HMGCR-cat transgenic mice 



Construct* 


line 


Copy No. b 


CAT activity d in mouse tissues c 
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HMGl-cat 




.10 


12 
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21 
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62 


31 


59 


HMGl-cat 


40 e 


22 


17 


20 


107 
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90. 


72 


na 


HMGl-cat 




22 


30 


90 


95 


15 


80 


110 


140 


HMGl-cat' 


' 8.0* 


— 45 ' 




32 


'21 " 
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70 


23 


na 


HMGl-cat 


78 c 


260 


0.6 


0.5 


2 


0.6 


0.6 


0.5 


32 


HMGl-cat 


61 ' 


11 


0.6 


0.9 


0.7 


. 0.6 


0.7 


0.8 


135 


HMG2-cat 


76 f 


170 


0.5 


1.6 


0.7 


0.6 


0.7 


0.8 


208 



a Construct -pHMGl-cat contains a 5.5-kb JBamBl segment of the mouse HMGCR gene (Mehtali, 1988) comprising the mouse HMGCR promoter region, 
the first (untranslated) exon, and the first intron, linked to the bacterial cat gene (Fig. 1 A) and propagated in pla smid pPol ylH-i (Lat he et aL , 1987). The 
downstream Bam HI site used in this construction lies immediately beyond the HMGCR exon II splice acceptor site (Mehtali, 1988; Gautier et aL, 1989; 
M.M. and R.L., in preparation). In pHMG2-cat the region between the PvuU site in the HMGCR first exon and the BamHJ° site at the 5 ' end of col 
(see Fig. 1 A) was removed: the structure of the fusion gene is presented in Fig. 4. Plasmid DNAs were propagated on£. cofi 1 1 06 (thr leu tkiksdS). HMG-cai 
fusion genes were excised from the vector by Aforl digestion and punned by sucrose gradient centrifugation (10-30% sucrose, 35 000 rpm, Beckman SW41 
rotor, 14 h, 20°C). Two hundred copies were injected into fertilized eggs (C57B1/6 x SJLF1 hybrids) and the presence of the transgene was detected 
by Southern blot analysis of DNA from the tails of four-week-old animals (Palmiter et aL, 1982). Transgenic lines were established by systematic 
back-crossing with C57B 1/6 x SJLF1 hybrids. 

b Copy numbers were determined by quantitative densitometry of Southern blots; all transgenic animals carry the transgene as an unre arranged tandem 

repeat integrated at a single autosomal location (data not presented). 

c Tissues were: ta, tail; K, liver; in, intestine; ki, kidney; br, brain; st, stomach; te, testis. 

d CAT activity in tissue homogenates (ultra-turrax, in 0.25 M Tris-HCL pH 7.8, 1 mM phenylmethyrsulfonylfluoride) was determined, after heat treatment 
(10 min, 65 °C) and clarification, by the transfer of radioactivity from 14 C acetyl CoA to ethyl-acetate-soluble Cm in a standard assay (Sleigh, 1986) at 
a constant protein concentration as measured using a commercial assay kit (BioRad), and is expressed as the incorporation of 14 C acetyl into Cm 
(cpm x 10 3 ) per 100 fig protein. The background value for CAT activity in negative extracts was 0.5. na, not applicable, 

e Animals used for analysis in Fig. .2 (founder transgenic male of line 78 ; founder transgenic females of Knes 40 and 80, and a second generation male 
for line 74; the remaining line 40 animal was a second generation male; M.M and R.L., in preparation). 
f Animals used for analysis were second generation males. 



ment excision, Fig. 3c), 45 (48%) or 260 copies (78%) 
(Fig. 3d,e). The methylation pattern appeared identical in 
other tissues examined (intestine, testis) though the extent 
of the hypomethylated region was slightly larger in testis 
than in the other tissues (data not presented). The strength 
of the hybridization signal originating from the transgene 
unfortunately precluded examination of the methylation 
pattern of the endogenous HMGCR gene in these transgenic 
mice. 

(c) HMG2-cat transgenic mice 

To determine whether there is a general correlation 
between increased transgene copy number and loss of 
methylation-free status we constructed a deletion derivative 
of the HMGl-cat construct, HMG2-cat, in which a sub- 
fragment of the HMGCR promoter region is linked directly 
to cat (Fig. 4A). HMG2-cat was introduced into the mouse 
germ line and tissues from transgenic anim als were analysed 
for CAT activity (Table I). In the two lines examined (61 
and 67; Table I) activity was detected in testis but not in 
other tissues. The methylation state of the fusion transgene 
in lines 6 1 and 76 was examined by HpaU or Mspl digestion 



and Southern hybridization to probes designed to detect the 
reporter gene segment of the transgene (Fig. 4B) or the 
HMGCR promoter region (data not presented). 

In liver (Fig. 4C) and intestine (data not presented), 
tissues in which no expression is observed (Table I), the 
fusion transgene was extensively methylated in both trans- 
genic lines, bf testis of mice harboring eleven copies of the 
transgene (line 61), a tissue in which the transgene is 
expressed, the HMG2-cat promoter and surrounding 
regions were essentially devoid of methylation (Fig*4C^ 
also data not presented). In contrast, the transgene in testis 7 
of line 76 (harboring 170 copies) was found to be sub- 
stantially methylated (Fig. 4C), supporting a link between 
increased copy number and loss of methylation-free status. 
Nevertheless, in testis of line 76 a small proportion of 
transgene copies appeared to be devoid of methylation; 
these few unmethylated' copies may be responsible for the 
observed transgene expression level. Indistinguishable 
methylation patterns were obtained in all cases irrespective 
of whether the probe employed covered the reporter gefl e 
segment (Ffe. 4C) or the HMGCR promoter region (data 
not presented). It is of note that hypomethylation of the 
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gMOCR promoter region in testis of line 61 now extends 
^to the adjacent bacterial cat gene (Fig. 4). Similar hypo- 
m ethylation of cat was observed in testis of a further trans- 
genic mouse line (No 60) harboring 15 copies of the HMG2 

^cit transgene (data not presented), 

■ ■ 

(d) Conclusions 

HTF islands are associated with control regions of active 
genes, particularly the housekeeping genes (Bird, 1986; 
j^avia et al., 1987; Gardiner-Garden and Frommer, 1987). 
It was previously reported that the methylation-free HTF 
Island of the Thy-l gene is maintained when the intact gene 
is introduced into the mouse germ line (Kolsto et al., 1986). 
We describe here that hypomethylation of a housekeeping 
gene {HMGCR) promoter is dependent upon transgene 
copy number, and methylation-free status of the H TF island 
at the 5 ' end of a HMGCR-cat fusion transgene, HMGl-cat, 
is lost at increasing copy-number. In further transgenic mice 
bearing the HMG2-cat deletion derivative of the fusion con- 
struct, CAT activity and transgene hypomethylation were 
only detected in testis. It is of note that ectopic expression 
of tissue- specific transgenes in testis has been observed 
previously (Lacy et>al., 1983; Shani, 1986; Al-Shawi et al., 
1988; and our unpublished data). As observed in HMGCR 
-cat transgenic animals, methylation-free status in testis of 
HMG2-cat transgenic .animals was also lost at high copy 
number. 

One possible explanation for loss of hypomethylation at 
high copy number is/out-titration of regulatory proteins or 
other binding factors that protect the DNA from methyla- 
tion. Our data do not exclude the possibility that out- 
titration of a demethylase activity (e.g., Razin et al., 1986) 
might also reduce methylation level. However, methylation 
of the transgene promoter at high copy number demon- 
strates that HTF island DNA is not intrinsically resistant, 
in vivo, to^Anethylation of C residues within CpG doublets. 
Because saturable factors thus appear to protect DNA from 
met hylatio n in vi vo, factor bind ing seems likely to precede 
methylation. In consequence, it would appear unlikely that 
the methylation status of the HMGCR promoter itself plays 
a major role in determining the extent of factor binding in 
vivo. 

Despite the fact that the HMGCR promoter present on 
merransgene can outnumber-the endogenous HMG€R-^o^ 
moter by a factor of 100, we have not detected any altera- 
tion in the expression of the endogenous gene (M.M., R.L. 
and G. Boukamei, unpublished data), in agreement with the 
conclusions of a study (Davis and MacDonald, 1988) using 
a rat elastase I transgene. Because high transgene copy 
number appears to be without effect on the expression of the 
endogenous HMGCR gene, binding factors may be only 
bcally out-titrated, possibly arguing for restricted diffusion 
of binding factors (see Richetti et al., 1988). 



Although we report saturable . hypomethylation of the 
HMGCR promoter, this result may contrast with the report 
of Kolsto et al. (1986) who describe hypomethylation of a 
hybrid Thy-1 gene promoter in mice bearing 60 copies of the 
transgene. However, the presence of extraneous sequences 
in Thy-1 transgenic mice (Grosveld and Kollias, 1988) com- 
plicates interpretation. It therefore remains unclear whether 
loss of methylation-free status and/or local out-titration of 
binding factors is ageneral feature of large transgene arrays. 

We alsb report that the hypomethylated region of the 
HMGCR-HTF island can extend into adjacent bacterial 
DNA (cat). This phenomenon was only observed when the 
cat reporter gene was linked directly to the HMGCR pro- 
moter region (construct HMG2-cat) and not when the cat 
segment was separated from the HMGCR regulatory region 
by 4 kb of intervening HMGCR genomic DNA (construct 
HMGl-cat). Hypomethylation of adjacent bacterial DNA 
was only observed in testis, the only tissue in which trans- 
gene expression was detected, and we speculate that 
cooperative binding of factors (e.g., Phillips et al., 1989; see 
also Murray and Grosveld, 1 987) to the HMGCR promoter 
and to adjacent CpG-rich bacterial DNA may be responsi- 
ble. We cannot however exclude the possibility that tran- 
scription per se can contribute to under-methylation. 

Taken together, our data argue that the cytosine- 
methylase passively methylates DNA according to its 
accessibihty/affinity for the methylase. Because DNA 
methylation can inhibit gene activity, passive methylation of 
transcriptionally inactive regions may contribute to the 
repressed state. Lower eukaryotes and invertebrates lack 
detectable DNA methylation, and the large genome sizes of 
vertebrates and plants may provide a selective advantage 
for DNA methylation (Antequera and Bird, 1988), for 
instance by marking untranscribed and passively-methyl- 
ated DNA for higher-order condensation. 
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Summary 

Expression and DNA methylation of the Moloney 
murine leukemia virus (M-MuLV) genome were in- 
vestigated in murine teratocarcinoma cells after 
virus infection. The newly acquired viral genome 
was devoid of methylation, yet its expression was 
repressed. The Integrated viral genome In undiffer- 
entiated teratocarcinoma cells was methylated 
within 15 days after infection. Although 5-azacyti- 
dine decreased the level of DNA methylation, it did 
not activate M-MuLV in undifferentiated cells. Acti- 
vation by 5-azacytidine occurred only in differen- 
tiated, teratocarcinoma cells. Thus two independent 
mechanisms seem to regulate gene expression dur- 
ing the course of differentiation. The first mecha- 
nism operates in undifferentiated cells to block 
expression of M-MuLV and other exogeneously ac- 
quired viral genes, such as SV40 and polyoma virus, 
and does not depend on DNA methylation. The 
second mechanism relates only to differentiated 
cells and represses expression of genes in which 
DNA is methylated. 

Introduction 

Laboratory strains of mice as well as field isolates of 
Mus species carry a large number of endogenous 
'type C virus genomes. Blot hybridization analysis of 
mouse DNA digested with various restriction enzymes 
suggested that ceils of inbred mice have as many as 
50 copies of DNA sequences related to type C viruses 
(Dolberg et al., 1981). Inducible murine endogenous 
viruses are classified into two host range groups, 
ecotropic viruses and xenotropic viruses. Chromo- 
somal locations of ecotropic viruses can be deter- 
mined by classical Mendelian genetics. Ecotropic vi- 
rus lock of AKR mice, Akv1 and Akv2, have been 
mapped on chromosome 7 (Rowe et al., 1972) and 
on chromosome 1 6 (Kozak and Rowe, 1 980), respec- 
tively. The Cv locus, an ecotropic virus locus of C3H/ 
HeJ and BALB/c strains was detected on chromo- 
some 5 (Kozak and Rowe, 1979; Ihle et al., 1979). 
C3H/Fg mice have an ecotropic virus locus op chro- 
mosome 7 at a site distinct from the Akv1 locus. C58 
and B10-BR mice carry unnamed ecotropic virus locjL— 
on chromosome 8 and on chromosome 1 1 , respec- 
tively. Thus the distribution of endogenous ecotropic 



virus loci over a wide range of mouse chromosomes 
suggests that these endogenous virus genomes may 
have recently derived from a common prototype virus 
through germiine integration during establishment of 
laboratory mice. Actually, Jaenisch demonstrated that 
infection of preimplantation stage BALB/1 29 mouse 
embryos with M-MuLV leads to integration of the virus 
into the germ line, in an endogenous unexpressed 
state (Jaenishch, 1976). 

Infection of mouse embryo fibroblasts with murine 
ecotropic viruses . usually results in stable integration 
of the viral genome and subsequent production of 
progeny virus. This makes for a clear distinction be- 
tweeiriTbrd^^ 

stage embryos where infection with exogenous vi- 
ruses results in repression of the integrated virus 
genomes. Murine teratocarcinoma cells are the malig- 
nant and pluripotent stem cells derived from carci- 
noma of preimplantation stage embryos (Lehman et 
al., 1 974). Teratocarcinoma stem cells infected with 
ecotropic murine type C viruses do not produce prog- 
eny virus (Peries et al., 1977; Teich et al., 1977; 
Huebner et al., 1979; Gautsch, 1980). The infecting 
viral genome, however, is carried in these cells and 
can be activated with bromodeoxyuridine (BrdUrd) 
(Speers et al., 1980). 

The expression of endogenous virus genomes in 
avian and mouse embryo fibroblasts is regulated by 
DNA modification and can be activated by treating the 
cells with 5-azaCytidine (5-AzaCyd), a potent inhibitor 
of DNA methylation (Groudine et al., 1 981 ; Niwa and 
Sugahara, 1 981 ). We studied the state of the M-MuLV 
genome in teratocarcinoma cells and found that the 
viral genome in undifferentiated stem cells is re- 
pressed not by DNA methylation but by some other 
mechanism. Only after differentiation of the cells is 
the viral sequence under the control of a mechanism 
regulated by DNA modification. 

Results 

Repression of M-MuLV in Undifferentiated 
Teratocarcinoma Cells, and Induction of Its 
Expression by BrdUrd and Retinoic Acid 
EC-A1 cells derived from PCC4 cells, a pluripotent 
stem cell line of mouse teratocarcinoma cells, were 
infected with M-MuLV. Cells were treated for 24 hr 
with BrdUrd and/or retinoic acid, a compound that 
induces differentiation of teratocarcinoma cells 
(Strickland and Mahdavt; 1 978) before or after virus 
infection. Since both BrdUrd and retinoic acid exerted 
potent cytotoxic effects on EC-A1 cells; the virus- 
producing cells in the drug-treated cultures may be 
preferentially eliminated to yield a false negative re- 
sult. Thus EC-A1 cells were treated with BrdUrd and/ 
" or retinoic acid before or after virus infection', and the 
cells were then cocultivated with SC-1 cells. The SC- 
1 - cells were passaged onpe and assayed for virus 
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expression. The results in Table 1 indicate that prein- 
fection and postinfection treatment with BrdUrd alone 
was effective for productive infection of the virus. 
Preinfection treatment with retinoic acid made the EC- 
A1 cells susceptible to infection with M-MuLV. The 
expression of the virus in the BrdUrd-treated cells was 
enhanced by the presence of retinoic acid. Although 
less efficient, postinfection treatment of the cells with 
BrdUrd together with retinoic acid did induce expres- 
sion of the virus. A low but definite production of M- 
MuLV was detected by amplification through SC-1 
cells when the EC-A1 cells were treated for 24 hr with 
0.3-1 M of retinoic acid alone, immediately after 
infection. 

Infection of F-9 cells, a nullipotent teratocarcinoma 
cell line (Bernstine et al., 1973), with M-MuLV also 
required preinfection or postinfection treatment with 
BrdUrd for efficient expression of the virus (O. Niwa, 
unpublished observation). 

The teratocarcinoma stem cells, EC-A1 and F-9 
cells, were all derived from 1 29 mice that contained 
the ecotropic virus sequences and were mostly sub*-' 
genomic in size (Chan et al., 1980). These have been 
classified "as a no virus strain (Chattopadhyay et al., 
1974). The virus which was recovered from BrdUrd- 
treated EC-A1 cells, and which had been infected with 
M-MuLVVgrew equally well on NIH/3T3 and BALB/ 
3T3 cells. The tropism of the virus and the lack of 
virus activation. from uninfected EC-A1 cells (O. Niwa, 
unpublished observation) strongly suggest that the 
virus thus recovered from infected EC-A1 cells after 
BrdUrd treatment is M-MuLV and not the virus endog- 
enous to EC-A1 cells. 

Isolation of EC-A1 (Mo) Clones Carrying M-MuLV 
Genome in a Represse d State - . 
EC-A1 cells were Infected with M"Mutv^"anTfo1 of 
1 .0, which had been determined by titration on SC-1 
cells. The cultures were then trypsinized, and the cells 
were plated for . ring cloning. Each of the randomly 
isolated clones was treated with 20 /ig/ml of BrdUrd 
and cocultivated with SC-1 cells. SC-1 cells were 
passaged three times and tested for virus expression 
by the reverse XC test. Of 1 1 0 clones thus tested, 47 
expressed virus after BrdUrd treatment. These virus- 
inducible clones, designated EC-A1(Mo) clones, 
therefore carry the M-MuLV genome, in a repressed 
form. 

Transcriptional Control of M-MuLV Expression in 
Undifferentiated Teratocarcinoma Cells 
Liquid hybridization experiments indicated no detect- 
able viral RNA transcript in M-MuLV-infected terato- 
carcinoma cells (T eich et al., 1 977). We have isolated 
total cellular RNA from one of the .vims-carrying 
clones, EC-A1(Mo)4. RNA was size-differentiated on 
agarose gel electrophoresis, transferred to diazoben- 
zyloxymethyl paper (DBM paper) and analyzed for the 



Table 1. Plaque-Forming Cells per'2 x 10" SC-1 Cells Cocultivated 
with Cells Treated Preinfection and Postinfection with BrdUrd 

and/or Retinoic Acid 



BrdUrd * Retinoic Acid Preinfection Treatment/ 

(l«g/ml) QtM) Postinfection Treatment 



0 


0 


0/0 


0 


0.03 


5/0 


0 


0.1 


14/0 


0 


0.3 


23/5 


0 


1.0 


46/3 


5 


P 


122/7 


5 


0.03 


634/7 


5 


0.1 


TMTC/25 


5 


0.3 


TMTC/31 


5 


1.0 


TMTC/23 


10 


0 


TMTC/6 


10 


0.03 


TMTC/15 


10 


0.1 


TMTC/43 


10 


0.3 


TMTC/22 


10 


1.0 


TMTC/19 


20 


0 


TMTC/1 2 


20 


0.03 


TMTC/21 


20 


0.1 


TMTC/48 


20 


0.3 


TMTC/39 


20 


1.0 


TMTC/51 



TMTC: too numerous to count. 



sequence hybridizable to 32 P-labeled M4/luLV DNA 
(Figure 1). SC-1 cells productively infected with M- 
MuLV had two major bands, 34S and 24S, corre- 
sponding to virus genomic RNA and spliced env gene 
messenger RNA. On the other hand, uninfected SC-1 
cells, EC-A1 cells and EC-A1(Mo)4 cells contained no 
detectable level of RNA sequence hybridizable to M- 
MuLV DNA. The same filter was hybridized with an 
18S rRNA probe. Three bands corresponding to 18S 
rRNA and two precursor RNAs were detected in all 
four RNA samples. The density of the bands revealed 
by this probe varied little among the four cell lines 
tested, indicating that the amounts of RNA used for 
the analysis were the* same for these cell lines. Total 
cellular RNA isolated from EC-A1 cells 48 hr after 
infection with M-MuLV was analyzed by the dot blot 
hybridization technique (data not shown). Here too, 
the RNA transcript of M-MuLV was not detected. 

5-AzaCyd Induction of M-MuLV Gene Expression 
and the State of Differentiation of 
Teratocarcinoma Cells 

We attempted to induce M-MuLV expression in. undif- 
ferentiated EC-A1 (Mo) clones by 5-AzaCyd. To our 
surprise, none of the 47 EC-A1 (Mo) clones expressed 
virus after treatment with 2 jug/ml 5-AzaCyd, although 
virus was readily recovered when the clones were 
treated with BrdUrd. 

To determine whether or not the DNA methylation 
was suppressed by 5-AzaCyd in undifferentiated ter- 
atocarcinoma cells, two clones, EC-A1(Mo)1 and EC- 
A1(Mo)4, were treated with 5-AzaCyd, and the level 
•of DNA methylation was measured. As is clear from 
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Figure 1 . Blot Hybridization of Total Cellular RNA Isolated from Four 
Cell Lines 

Probes used were 32 P-labeled M-MuLV DNA Oanes a, b, c and d) and 
32 P-iabeled cloned rRNA gene (lanes e, L f; g and h). (Lanes a and e) 
RNA from SC-1 cells; Oanes b and 0 RNA from SC-1 cells infected 
with M-MuLV; Oanes c and g) RNA from EC-A1 cells; (lanes d and h) 
RNA from EC-A1 (MoM cells. 

Figure 2, The level of methylcytosine in cellular DNA 
decreased with increasing doses of 6-AzaCyd. There- 
fore, in undifferentiated teratocarcinoma cells, sup- 
pression of DNA methylation does not seem to result 
In activation of the viral genome. 

EC-A1(Mo)4 cells were treated with dimethylace- 
tamide for 2 weeks to induce differentiation, and such 
was assessed to be complete by the morphology. M- 
MuLV was never expressed in the differentiated EC- 
A1 (Mo)4 cells. These cells were then treated with 5- 
AzaCyd and cocultivated with SC-1 cells. The SC-1 
cells were passaged twice, and the reverse XC test 
was performed at each passage. Only in the differ- 
entiated EC-A1(Mo)4 cells was the 5-AzaCyd-acti- 
, vated expression of M-MuLV evident (Table 2). There- 
fore, at least in the differentiated EC-A1(Mo)4 cells, 
the viral genome seems to be regulated by the extent 
of DNA methylation, and suppression of DNA methyl- 
ation activates expression of the virus. 

Lack of DNA Methylation of Unintegrated M-MuLV 
Genome 

The data presented above indicate that'the M-MuLV 
genome in the differentiated teratocarcinoma cells is 
methylated. Experiments were then designed to de- 
termine the timing of DNA methylation of the M-MuLV 
genome, after virus infection of the undifferentiated 
cells. 

EC-A1 cells were infected with M-MuLV, and DNA 
was extracted from the Hirt supernatant fraction, which 
contained unintegrated viral genome. DNA isolated 
from EC-A1 cells 6 hr after infection contained three, 
molecular species of M-MuLV— namely, closed cir- 
cular, linear and open circular DNAs (Figure 3, lane 




i - ' 

l . -J 1— 1 L 

0 1.0 , . 2.0 

5-Azacytldlne cone, (ug/ml) 

Figure 2. Level of Methylcytosine in Newly Replicated DNA of EC- 
A1 Cells 

Cells were seeded onto 3 cm dishes at a concentration of 1 x 10* 
cells per dish. After overnight incubation, the dishes were nourished 
with medium containing (methyl- 3 H)methtonlne at 4 /iCI/ml and (2- 
14 C)thymidins at 0.005 /tCi/mt. DNA was collected, hydrolyzed and 
separated on a cellulose thin-layer glass plate. The ratio of 3 H and 
"C counts in methylcytosine and thymidine respectively was taken 
as a relative measure of methylcytosine content (O) EC-A1(Mo)1 
cells; CAJEOAKMoH cells. 



Table 2. Plaque-Forming Cells per 2 x 10 s SC-1 Cells Cocultivated 
with EC-A1(Mo>4 Cells Treated with 5-AzaCyd 



Concentration of First Passage/ 
Cells 5-AzaCyd (jig/ml) Second Passage 



EC-A1£lVloH cells 






undifferentiated 


2 


o/o ' ■* 




4 


0/0 




6 


. 0/0 




e 


0/0 


EC-A1(Mo>4 cells 






differentiated 


1 


0/0 




2 


2/31 




4 " 


13/TMTC • 


•TMTC: too numerou 


s to count 





a). Closed circular ON A and open circular DMA* con- 
sisted of two subbands differing slightly in size. A 
linear molecule had a single band of 8.8 kb. Upon 
cleavage with Hind III, two bands corresponding to 
8.2 kb and 8.8 kb linear molecules were detected 
-(Figure 3, lane b) and were assumed to be the full— 
sized M-MuLV genome with one and two long terminal 
repeats (LTRs). All of these sequences were com- 



Cell 
1108 



open circular ^ 

linear 8.8 kb I 
8.2 kb - i 

: 
I 

closed circular ^ [. 




'*sf 



Figure 3. Blot Hybridization ot Unintegrated M-MuLV Genome in 
Undifferentiated EC-A1 Cells 

EC-A1 cells were infected with M-MuLV. DNA was extracted from the 
Hlrt supernatant fraction of EC-A1 cells 6 hr aftar Infection with M- 
MuLV (lanes a, b, c and d), or 1 2 hr after infection (lanes e, f, g and 
h). The probe used was ^P-labeled M-MuLV DNA. (Lanes a and e) 
undigested DNA; (lanes b and f) digestion with Hind 111; (lanes c and 
g> digestion with Hap II; (lanes d and f) digestion with Msp I. 

pletely* digested by Hpa H and Msp I, suggesting that 
they are devoid of methylation at the CCGG sequence 
(Figure 3, lanes c, d). Similar results were obtained 
with DNA isolated at 1 2 hr after infection (Figure 3, 
lanes e, f, g, h). However, one difference in the 1 2 hr 
sample was that this DNA. lacked a closed circular 
molecule (Figure 3, lane e). It is of interest that the 
Hirt supernatant DNA isolated 24 hr after infection 
had a much lesser amount of viral DNA, while the 48 
hr sample was devoid of M-MuLV sequences (data 
not shown) In the case-of^SG-i-cells-.-DNA-extracted 
24 hr after infection had the greatest amount of unin- 
tegrated M-MuLV provirus sequence (0. N'rwa, unpub- 
lished observation). 

Undermethylation of Freshly Integrated M-MuLV 
Genome and Its Subsequent Methylation during 
Multiple Cell Cycling of Undifferentiated Cells 

EC-AI cells were infected with M-MuLV at a moi of 2, 
and DNA was isolated 48 hr later. The integration of 
viral genome occurs at random sites. Digestion of 
DNA from randomly infected cells with Eco Rl t which 
does not cleave the M-MuLV genome, will produce 
multiple fragments of various sizes carrying integrated 
viral sequences flanked by cellular sequences, and 
these cannot be resolved by agarose gel electropho- 
resis. Therefore, we used Bam HI, which cut the M- 
MuLV genome internally. As a probe to detect viral 
genome, the 316 base Sma I fragment of M-MulV 
DNA was labeled with 32 P. This probe allows for de- 
tection of the 3 kb internal fragment of Bam Hl-di- 
gested M-MuLV together with other sequences de- 
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Figure 4. Blot Hybridization of Newly Integrated M-MuLV DNA In 
Undifferentiated EC-A1 Cells 

ONA was Isolated from unlfected EC-A1 cells (lane a, b and c), EC- 
Ai(Mc}4 cells (lane d, e and f) or EC-A1 cells 48 hr after infection 
with M-MuLV (lanes g, h'and 0- The probe used was the 32 P-labeIed 
Sma 1 fragment of M-MuLV DNA. (Lanes a, d and g) digestion with 
Bam HI; (lanes b, e and hj digestion with Bam HI plusHap II; (lanes 
c, f and i) digestion with Bam HI plus Msp I. 

rived from endogenous viruses (Figure 4 f lanes a, d, 
g). Uninfected EC-A1 cells lacked the 3 kb M-MuLV- 
specific sequence (Figure 4, lane a). DNA of EC-A1 
. cells 48 hr after infection contained the 3 kb band 
(Figure 4, lane g), and this fragment was not derived 
from the unintegrated M-MuLV, as unintegrated pro- 
viral DNA disappeared from the Hirt supernatant frac- 
tion of EC-A1 cells before 48 hr of Infection. Double 
-digestion of 48 "hr postinfection DNA with Hap-ii -and 
with Msp I completely eliminated the 3 kb Bam HI 
band, suggesting that this part of the M-MuLV genome 
is devoid of DNA methylation (Figure 4, lanes h, i). 
DNA was isolated from EC-A1(Mo)4 cells that had 
undergone multiple cell cycling after virus infection. 
The M-MuLV-specific 3 kb Bam HI band was clearly 
demonstrable in the cellular DNA (Figure 4, lane d). 
However, the same sequence was now resistant to 
Hap II digestion (Figure 4, lane e). Other bands derived 
from endogenous virus genomes were also resistant 
to Hap II (Figure 4, lanes b, e, h). DNAs from another 
EC-A1 (Mo) clone, EC-A1(Mo)22, and randomly in- 
fected EC-A1 cells passaged at least 20 times also 
contained the Hpa ll-resistant 3 kb Bam HI fragment 
specific to M-MuLV. 

Experiments were performed to determine the time 
of the DNA methylation of M-MuLV genome after 
infection. DNA was isolated from M-MuLV-infected 
EC-A1 cells on days 4, 1 0 and 1 5 after infection, and 
analyzed for the state of DNA methylation (Figure 5). 
The M-MuLV genome was devoid of methylation at 
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Hap II sites up to day 10 (Figure 5, lanes b, e). 
However, the virai genome was resistant to Hap II 
when it was isolated from 1 5-day-old cultures. There- 
fore, the integrated M-MuLV genome was methylated 
between days 10 and 15 in culture. Production of M- 
MuLV from. infected EC-A1 cells during these 1 0 days 
was nil. 

These results indicate that although unintegrated 
and newly integrated M-MuLV genomes were devoid 
of DNA methylation, the same sequence is methylated 
in the undifferentiated cells kept in culture for over 1 5 
days. 

Lack of Change in the State of DNA Methylation of. 
the M-MuLV Genome during Differentiation of 
Teratocarcinoma Cells 

DNA was isolated from undifferentiated EC-A1(Mo)4 
cells and dimethyiacetamide-induced differentiated 
EC-A1(Mo)4 cells. These DNAs contained the 3 kb 
Bam HI fragment of the integrated M-MuLV genome, 
as well as other bands derived from endogenous virus 
genomes (Figure 6). The 3 kb fragment of M-MuLV in 
undifferentiated EC-A1 (Mo)4 cells was again resistant 
to digestion with Hap II, confirming the result in Figure 
4, lane'e TFTgure 6, lane b). Similar resistance to Hap 
II enzyme was noted when the DNA.from differentiated 
EC-A1(Mo)4 cells was analyzed. Therefore, the pat- 
tern of DNA methylation of at least the 3 kb Bam HI 
fragment did not change during differentiation of the 
cells, yet inducibility of the viral genome by 5-AzaCyd 
treatment changed drastically. 

Transfection with DNA from M-MuLV Infected 
Teratocarcinoma Cells 

DNA could be isolated from EC-A1 cells 2 days after 
infection with M-MuLV and also from EC-A1 (Mo) 
clones. SC-1 cells were transfected with these DNAs 
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Figure 5. Blot Hybridization of Newly Integrated M-MuLV DNA in 
Undifferentiated EC-A1 Celts 

DNA was isolated from EC-A1 ceils on the day 4 (lanes a, b and c), 
day 10 (lanes d, e and f) or day I5(lanes g,.h and i) after Infection 
with M-MuLV, or from uninfected EC-A1 cells (lanes j, k and I). (Lanes 
a, d, g and j) digestion with Bam HI; (lanes b, e, h and k) digest! pn-^-" 
with Bam HI plus Hap II; (lanes c, f, i and 0 digestion with Bam HI plus 
Msp I. * 



) 

(Table 3). Although at a low efficiency, DNA from EO 
A1 cells infected with M-MuLV 2 days previously was 
capable of producing M-MuLV f while DNA from EC- 
A1(Mo) clones was not. 

Discussion 

Jaenisch and. coworkers have demonstrated that en- 
dogenous virus can be formed by infection of the 
preimplantation stage embryo ceils, with exogenous 
virus (Jaenisch et al. ( 1 975; Jaenisch, 1 976). Infection 
of undifferentiated stem cells of mouse teratocarci- 
noma leads to silencing of the exogenously acquired 
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Figure 6. Blot Hybridization of M-MuLV Genome in Undifferentiated 
and Differentiated EC-A1(Mo)4 Cells 

DNA was isolated from undifferentiated (lanes a, b and c) and differ- 
entiated EC-A1(Mo)4 cells (lanes d, e and f). The probe used was a 
^P-labeled 1 .65 kb Bam Hl-Hind III fragment of M-MuLV DNA. (Lanes 
a and d) digestion with Bam HI; (lanes b and e) digestion with Bam HI 
plus Hap II; (lanes c and f) digestion with Bam HI plus Msp I. 



Table 3. Transfection of SC-1 Cells with DNA from M-MuLY- . . 
Infected Cells 





DNA per 






Dish 


Virus-Positive/ 


DNA Source 




Total 


SC-1 


20 


0/20 


SC-1 M-MuLAMnfected 


20 


23/30 


EC-A1 


• 20 


0/20 


EC-A1(MoH 


20 


0/20 


EC-A1(Mo)22 


.20- 


0/20 


EC-A1(Mo)26 


20 


0/20 . 


EC-A1 M-MuLV-infected' 


20 


9/20 


* DNA was isolated 2 days afl 


ter infection with M-MuLV. 
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type C virus genome (Teich et a!., 1977; Speers et 
al., 1980). However, the mechanism of repression of 
integrated viral genome in the cells of the preimplan- 
tation stage embryo and in teratocarcinoma cells has 
remained unknown. 

Undifferentiated teratocarcinoma cells have char- 
acteristics of preimplantation stage embryo cells in 
that they lack H-2 antigen on the cell .surface (Artzt 
and Jacob, 1974), possess two active X chromo- 
somes in female cells (Martin et aL, 1978; McBurney 
and Strutt, 1980) and have the potential to differen- 
tiate into a variety of cell types (Kleinsm.ith.and Pierce, 
1964; Mintz and lllmensee, 1975). Thus teratocarci- 
noma cells are a pertinent model for studies on the 
undifferentiated state of embryogenesis. 

For cells to be in an undifferentiated state, they 
must be equipped with the potential to suppress spe- 
cifically expression of genes required only for differ- 
entiated cells. Mouse teratocarcinoma cells are re- 
sistant to exogenously incorporated genetic elements 
such as SV40 and polyoma virus (Swartzendruber 
and Lehman, 1975; Swartzendruber etai., 1977; Se- 
gal and Khoury, 1979) as well as to murine leukemia 
virus. These viral genomes may be regarded as un- 
necessary luxury genes in undifferentiated teratocar- 
cinoma cells. 

We found that the unintegrated M-MuLV genome 
and at least part of the newly integrated genome, the 
3 kb Bam HI fragment, in undifferentiated teratocar- 
cinoma cells are devoid of DNA methylation, yet the 
expression of the genome is repressed. SV40 DNA 
was also shown to be undermethylated in teratocar- 
cinoma cells (Friedrich and .Lehman, 1 981 ). Treatment 
with retinoic acid, which triggers differentiation of the 
cells, can activate virus expression in undifferentiated 
cells, provided thardrugtreatmeTTrto11ows'shortly~after - 
infection with M-MuLV, when methylation of the M- 
MuLV genome has not yet occurred (Table 1). How- 
ever, the frequency of virus expression induced by 
retinoic acid was rather low, and for the entire course 
of differentiation about 10 days were required. The 
low frequency thus observed might be due to the 
incompleteness of differentiation. The finding that the 
freshly integrated M-MuLV genome in undifferentiated 
teratocarcinoma cells is transfectionally active on SC- 
1 cells (Table 3) suggests that this genome could be 
transcriptionally active in differentiated cells. There- 
fore, the M-MuLV genome freshly acquired by EC-A1 
cells seems to be devoid of DNA modification, and 
DNA modification is known to reduce the rate of 
transcription (Stuhlmann et al., 1981; Hoffmann et al., 
1982). Actually, the M-MuLV genome was devoid of 
methylation at Hpa II sites for at least 10 days after 
infection; it was not expressed in the undifferentiated 
cells. The 3 kb fragment of M-MuLV genome is sub- 
sequently methylated in infected cells kept in culture 
for over 1 5 days. The virus genome becomes trans- 
fectionally inactive and cannot be induced by simple 



differentiation of the cells. Also, suppression of DNA 
methylation by 5-AzaCyd did not activate expression 
of the virus genome in EC-A1(Mo) clones. Dot blot 
hybridization analysis of RNA isolated from 5-AzaCyd- 
treated EC-A1 (Mo)4 cells that carry the methylated 
M-MuLV genome did not contain RNA hybridizable to 
M-MuLV probe, while in the BrdUrd-treated culture 
there was a marked increase of M-MuLV transcript 
(O. Niwa, unpublished observation). Therefore, we 
conclude that the M-MuLV genome is repressed in the 
undifferentiated cells by a mechanism other than DNA 
methylation. This repression can be unblocked by the 
treatment of the cells with BrdUrd. EC-A1 cells freshly 
infected with M-MuLV, and carrying the unmethylated 
viral genome, and EC-A1(Mo) clones presumably 
carrying the methylated M-MuLV genome were both 
induced by BrdUrd treatment (see Table 1 and above). 
Incorporation of BrdUrd into DNA does not affect the 
level of methylation (unpublished observations). 

DNA methylation has no effect on the transcriptional 
activity of the M-MuLV genome in undifferentiated 
cells. The M-MuLV genome is nevertheless methyl- 
ated in cells that have undergone multiple cell cycling. 
Genes that are not transcribed may be preferentially 
methylated in undifferentiated cells. 

The pattern of DNA methylation of the 3 kb Bam HI 
fragment did not change during differentiation of EC- 
A1(Mo)4 cells. However, the M-MuLV genome is now 
susceptible to induction by 5-AzaCyd. Repression of 
the viral genome by DNA methylation thus seems 
operative only in differentiated cells. DNA methylation 
is thought to suppress gene expression through a 
condensation of chromatin, and antisera raised 
against methylcytosine bind to the heterochromatic 
region of the mouse chromosome (Miller et al., 1 974). 
Methylcytosine is more abundant in the fraction of 
chromation that is resistant to DNAase digestion (Ra- 
zin and Ceder, 1977). Methylated sequences of en- 
dogenous viruses are located on chromatin regions 
that are resistant to DNAase I (van der Putten et al., 
1982). 1 

- Cells of the preimplantation stage embryo have two 
active X chromosomes and lack heterochromatin (Ep- 
stein et al., 1978). Heterochromatinization of an X 
chromosome occurs during differentiation of terato- 
carcinoma cells (Martin et al., 1978; McBurney and 
Strutt, 1980). Condensation of the heterochromatic 
region may even be facilitated by the presence of yet 
unidentified chromatin protein(s), and differentiation 
of cells may trigger production of this chromatin pro- 
tein, which inactivates already methylated luxury 
genes by condensation of their chromatin regions. 

In light of all these data, we propose the presence 
of two independent mechanisms regulating gene 
expression In mammalian cells. The first mechanism 
which is operating in undifferentiated cells, is not 
influenced by the state of DNA methylation for its 
function! Repression of transcription by this mecha- 
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nism may involve discrimination against the luxury 
gene promoter. Host range mutants of polyoma virus 
that can replicate on undifferentiated teratocarcinoma 
cells were found to possess mutations at the promoter 
region (Sekikawa and Levine, 1981). The second 
mechanism suppresses the expression of methylated 
genes by changing the conformation of their chroma- 
tin domains, and this mechanism operates only in 
differentiated cells. When M-MuLV infects undiffer- 
entiated teratocarcinoma cells, the viral genome is 
suppressed by the first mechanism. The transcription- 
ally inactive genome of M-MuLV may be gradually 
methylated during replication of the host cells. Differ- 

. entiatlbn of the.cells terminates regulation by. the first 
mechanism and activates the second mechanism, 
which now recognizes the methylated genome of M- 
MuLV and represses its expression by condensation 
of the chromatin domain. 5-AzaCyd unlocks the sec- 
ond mechanism by decreasing the level of methylcy- 
tosine, while BrdUrd unblocks both mechanisms. 
Since BrdUrd is known to change the affinity of DNA- 
binding proteins for DNA (Lin and Riggs, 1972, 1976; 
Goeddel et al., 1977), binding of some chromatin 
protein(s) to the promoter region of M-MuLV may be 
respbhsibleTbr operation of the first mechanism. The 
first mechanism of the repression of gene expression 
seems to be frans-acting, since the extrachromosomai 
genome of SV40 and the M-MuLV genome integrated 
at random sites in undifferentiated cells are repressed. 
These findings suggest that the first mechanism may 
inactivate gene expression, possibly by some diffusi- 
ble repressor-like protein, and if such is the case, the 
frans-acting nature can be readily explained. 

During preparation of this manuscript, a report ap- 
peared on a similar subject (Stewart et al., 1982) 
These authors' data suggest that the M-MuLV genome 
becomes methylated immediately after integration. 
This difference between our results and theirs might 

. relate to different cell lines. We used PCC4-derived 
EC-A1 cells, while they used F-9 cells. Gautsch and 
Wilson (1983) apparently obtained findings similar to 
those we report here. 

Experimental Procedures 
Cell Lines and Viruses 

EC-A cells (Gautsch. 1980), a subline of PCC4 Aza1 cells (Jakob et 
al., 1973), were kindly provided by J. Gautsch. EC-A1 cells, a 
subclone of EC-A cells isolated in our laboratory, were also used. F- 
9 cells .(Strickland and Mahdavi, 1978) were obtained from KL Sekl- 
gawa. SC-1 cells, a mouse embryo fibroblast line derived from a feral 
mouse (Hartley and Rowe, 1975), were obtained from A. Decleve. 
Teratocarcinoma cell lines were grown In Eagle's minimal essential 
medium supplemented with 10% heat-Inactivated fetal calf serum. 
SC-1 cells were grown In MEM alpha medium (Gibco) supplemented 
with 10% heat-inactivated calf serum. 

M-MuLV was obtained from A. Ishimoto and grown on SC-1 cells. 

Virus Infection, Transfection and Biological Assays 
Procedures of virus infection were as described previously (Niwa-et— " 
al., 1973). Titration of M-MuLV stock was on SC-1 cells by UV-XC 
assay (Rowe et al., 1970). M-MuLV-ihfected teratocarcinoma cells 



were assayed directly or after cocultivation with SC-1 cells by the 
reverse XC cell assay (Niwa et al., 1 973). 

Transfection of SC-1 cells with DN A from M-MuLV-infected EC-A1 
cells was carried out by a procedure described by other workers 
{Stuhlmannetal., 19B1). 

Drug Treatments 

Stock solutions of retinoid and BrdUrd were prepared in dimethyl- 
sulfoxlde at concentrations of 1 0 mM and 1 mg/ml, respectively, and 
kept in the dark at -20 C C. Teratocarcinoma cells were incubated 
with these drugs for 24 hr at 37°C either before or after infection with 
M-MuLV. When cocultivation with SC-1 cells was used for the ampB- 
fication of progeny virus, teratocarcinoma cells were treated for 30 
min with 25 ptg/ml mitomycin C to suppress .overgrowth of the 
undifferentiated cells. 

Differentiated teratocarcinoma ceils were obtained by the proce- 
.dutt_oLSpeers_et_al.^.9.8.0)^itb^ 

teratocarcinoma cells grown as monolayer cultures were nourished 
every 3 days with a medium containing 1 0 mM dlmethylacetamide for 
2 weeks. Cultures consisted only of ceils with an epithelial morphol- 
ogy. Although retinoid acid also induced differentiation of EC-A1 
cells, it was more cytotoxic than dimethylacetamjde. 

Base Analysis 

The level of methylcytosine in the newly synthesized QNA was as- 
sayed as described previously (Niwa and Sugahara, 1981). 

Extraction of Cellular RNA and DNA 

Total cellular RNA was isolated by sedimentation through, cesium 
chloride as described by others (Chirgwin et al., 1 979). 

For the isolation of DNA, dishes were washed with phosphate- 
buffered saline solution and digested at 37°C for 2 hr with 1 00 pg/ml 
RNAase A In 1% sodium dodecyisulfate, 0.1 M NaCt, 5 mM EDTA 
and 20 jnM Tris-HCl (pH 8.0). Proteinase K was then added at 100 
jig/ml. and dishes were incubated tor another 2 hr at 37°C. The 
lysate was extracted three times with phenol-chloroform and ethanol- 
preclpitated. DNA thus extracted was used for further analysis. Un- 
integrated viral DNA was isolated by the procedure of Hirt (1967). 

Restriction Endonuclease Digestion, Gel Electrophoresis and 
Hybridization 

Restriction enzymes were obtained from Takara Shuzo Co., Ltd. 
(Kyoto, Japan) except for Msp I, which was purchased from New 
England Biolaboratory. After cleavage with restriction enzymes, DNA 
samples were eiectrophoresed on 0.7% horizontal agarose gels and 
transferred to a nitrocellulose sheet, as described by Southern 
(1975). 

Total cellular RNA was denatured, as described (McMaster and 
Carmichael, 1977), eiectrophoresed on 1.1% agarose gels and trans- 
ferred to DBM paper accordingly (Atwine et al., 1 977). For the dot 
blot hybridization, nondenatured RNA was spotted directly onto nitro- 
cellulose filter CThomas, 1 980). 

The recombinant plasmid which carries the 8.2 kb M-MuLV ge- 
nome at the Hind HI site and which was cloned by J. W. Hoffmann, 
was a generous gift from R. A. Weinberg. The DNA fragment of 8.2 
kb containing the entire sequence of M-MuLV was purified by agarose 
gel electrophoresis. M-MuLV DNA was further digested by Bam HI 
and Sma I. The Hind ID-Bam HI fragment of 1 .65 kb was recovered 
from agarose gel after electrophoresis. The Sma I fragment of 31 6 
bases was eiectrophoresed through a poiyacrylamide gel. Recombi- 
nant plasmlds carrying 18S and 28S rRNA gene of the mouse were 
a kind gift from R. Komlnami (Urano et at., 1980; Komlnami et al., 
1982). These DNAs were labeled by the nick translation procedure 
(Maniatis et al,. 1975). Specific activities of the probes were 1-4 x 
10*cpm//ig. 
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Abstract 

Retroviruses are known to integrate in the host cell genome as proviruses, and therefore they are prone to cell-mediated control at the 
transcriptional and posttranscriptional levels. This plays an important role especially after retrovirus heterotransmission to foreign species, 
but also to differentiated cells. In addition to host cell-mediated blocks in provirus expression, also so far undefined host specificities, 
deciding upon the pathogenic manifestation of retrovirus heterotransmission, are in play. In this respect, we discuss especially the occurrence 
of wasting disease and immunodeficiency syndrome, which we established also in avian species using avian leukosis virus subgroup C 
(ALV-C) inoculated in mid-embryogenesis in duck or chicken embryos. The problem of provirus downregulation in foreign species or in 
differentiated cells has been in the recent years approached experimentally. From a series of observations it became apparent that provirus 
downregulation is mediated by its methylation, especially in the region of proviral enhancer-promoter located in long terminal repeats (LTR). 
Several strategies have been devised in order to protect the provirus from methylation using LTR modification and/or introducing in the LTR 
sequence motifs acting as ami methylation lags. In such a way the expression of retroviruses and vectors in foreign species, as well as in 
differentiated cells, has been significantly improved. The complexity of the mechanisms involved in provirus downregulation and further 
possibilities to modulate it are discussed. © 2000 Elsevier Science B.V. All rights reserved. 

Keywords: Retrovirus heterotransmission; Retroviral vector: Methylation: CpG: Long terminal repeat(s); Histone deacetylase 



1. Introduction 

The topic of this review is focused on selected aspects of 
retrovirus heterotransmission, but also touches the problem 
of retroviral genome silencing in differentiated and other 
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types of cells. In case of heterotransmission, many host 
cell factors are in play, which decide whether or to what 
degree the retroviral genome will be expressed and what 
pathogenic consequences may be triggered. As is known 
generally, the retroviral genome becomes integrated in the 
cell genome as a provirus and, therefore, it is not surprising 
that it is highly influenced by the host-cell gene-regulation 
machinery. In discussing such downregulation of the 
provirus we are in fact dealing with post-integration blocks 
in provirus expression, the nature of which is epigenetic and 
mediated by new host cell factors. The unusual cell milieu, 
in concert with which the virus has not been evolving, can 
be lacking some factors like those enabling viral RNA 
export from the nucleus, or provide unusual factors like 
those changing the viral RNA splicing. However, of main 
importance is cell transcriptional regulation, which in many 
cases leads to provirus silencing. It is, therefore, not surpris- 
ing that in phylogenetically distant host cells the provirus 
can integrate, but in many cases does not produce an infec- 
tious progeny. We call such host cells non-permissive, in 
contrast to permissive cells where formation of infectious 
virions takes place. 
Non-pemiissivenes to retroviral infection has been for a 
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long time ascribed to foreign species host cells. Interestingly 
enough, non-permissiveness to retroviral infection marks 
also some cells of the virus species origin. This has been 
recognized especially in cases of cultured differentiated 
cells. In such a way, surmounting hurdles negatively influ- 
encing provirus expression became of importance for an 
efficient application of retroviral vectors for gene therapy 
of developmental^ committed stem cells and differentiated 
cells. Therefore, we also discuss the provirus fate in some 
defined cases of differentiated cells that originate from the 
same species as the retrovirus. There are common features 
shared among all these situations, pointing to a role of cell- 
mediated gene silencing as an important factor that super- 
imposes upon provirus expression. 



2. Trans-class retrovirus heterotransmission 

This extreme situation was first achieved in the case of 
chicken Rous sarcoma virus (RSV) strains transmitted first 
to rodents and later also to other mammalian species, 
including monkeys (rev. Svoboda, 1986). RSV genome 
expression in mammalian cells is governed by a series of 
factors, especially by flanking DNA sequences, their rich- 
ness in GC (Fincham and Wyke, 1991; Rynditch et ah, 
1991), but also posttranscriptional steps are involved, 
which are blocked in mammalian cells (rev. Svoboda, 
1998). More recent progress indicates that of importance 
are RSV LTR, which are prone to methylation in mamma- 
lian cells. 

It was shown experimentally that a reporter gene driven 
by in vitro methylated RSV LTR is more efficiently 
suppressed in mammalian cells as compared to chicken 
cells (Hejnar et al., 1999). In spite of the fact thai there 



are 16 CpGs in the Prague strain of Rous sarcoma virus 
(PR RSV) LTR and multiple CpGs are present in leader 
sequences, methylation of one CCGG HpaU site, located 
downstream of the promoter region but close to the single 
provirus transcriptional start, was sufficient to produce 
dramatic reporter downregulation in mammalian cell lines. 
It should be noted that these experiments were done using 
transient transfection of methylated proviral DNAs and 
unmethylated controls. Expression of unmethylated LTR 
was comparable in both avian and non-permissive mamma- 
lian cells, suggesting that both types of cells harbor suffi- 
cient transcriptional machinery required by RSV LTR. 
Silencing of RSV proviruses is therefore a post-integration 
event. 

The significance of LTR methylation for provirus down- 
regulation has been recently approached using two strate- 
gies. The first one, schematically shown in Fig. 1, implies 
insertion of four canonical Spl binding sites in the RSV 
LTR enhancer region using EcoKl sites. Such reconstruc- 
tion has been documented and discussed in detail (Machon 
et al., 1998), and it was found that Spl insertion signifi- 
cantly increases LTR-driven chloramphenicol acetyltrans- 
ierase (CAT) reporter gene expression, especially in 
hamster cells using both transient and stable transfection 
assays. This is in agreement with findings revealing that 
Spl binding sites represent the critical part of sequences 
acting as antimethylation tags, as was documented clearly 
in the case of a CpG island containing 1.7 kb DNA located 
in front of the adenosine-phosphoribosyllransferase (aprt) 
gene (Mummaneni et al., 1993; Brandeis et al., 1994; 
Macleod el al., 1994). Acquisition of an Spl binding site 
by mutation in the LTR enhancer region was shown 
previously to activate MLV (murine leukemia virus) tran- 
scription in embryonic carcinoma (EC) cells (Prince and 



CGGGCCGGGCCGGAACTGGGCGGAGTTAGGCATG 
GTACGCCCCCCCCCG CCTTG ACCCGCCT CAATCC 




AATTCGGGGCGGGGCGGAACTGGGCGGAGTTAGG 

GCCCCGCCCCG CCTTGACCCGCCT CAATCCTTAA 
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Fig. I. Scheme of the CAT reporter vector employed for insertion of Spl sites into LTR sequence and transfeciion experiments. Sites of insertion (5/;//l and 
EcoKl) are denoted by arrows. Spl binding sites within the inserted sequences arc in bold. 
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Rigby, 1991). In a similar way, transcriptional activation of 
human endogenous retrovirus (HERV-H) was accomplished 
by Spl occurrence in either LTR enhancer (Nelson et al., 

1996) or promoter (Sj0ttem et al., 1996; Anderssen et al., 

1997) . The second approach is based on the incorporation of 
the mouse aprt gene CpG island immediately upstream to 
the RSV LTR-driven fused |3-galactosidase and neomycin 
resistance i$-geo) proviral reporter (see Fig. 2). After trans- 
fection of hamster cells and selection for neomycin resis- 
tance, such construction ensured stable transcriptional 
activity in a reasonable number of cell clones, whereas 
unprotected proviral reporters are inactivated. This effect 
can be attributed to the antimethylation protection of the 
CpG island for two reasons; first, the active proviruses 
remained unmethylated within the 5' LTR as evidenced 
by the bisulphite sequencing technique, second, the tran- 
scription efficiency of the RSV LTR is not increased in 
the presence of the CpG island. 

Thus, both approaches led to the conclusion that proviral 
LTR could be protected from DNA methylation in the 
foreign species host. Optimization of these protective stra- 
tegies might open the way to construct improved RSV- 
based vectors for gene transfer, more suitable for expression 
in mammalian ceils and without the risk of infectious retro- 
viral progeny. Retroviral vector producer cells (helper cells) 
are an important object for such protection as well. These 
cells have been shown genetically unstable due to the 
methylation of integrated helper proviral constructions. 
Designing a helper virus to overcome cellular DNA methy- 
lation may therefore improve vector production (Young et 
al., 2000). 

There exists also another way of trans-species retrovirus 
transmission utilizing xenotropic murine leukemia viruses 
(X-MLV) (rev. Levy, 1978). It is interesting that these 
viruses can replicate in some avian cells such as duck 
cells, but not in others like chicken cells (Levy. 1977). 
X-MLV can provide envelope components to pseudotype 
RSV virions and such pseudotypes transform and replicate 
in duck but not in chicken cells. In vivo inoculation of 



X-MLV or murine sarcoma virus (MSV) pseudotyped by 
X-MLV was performed in duck embryos or newborn duck- 
lings (Levy et al., 1982). Evidence of virus persistence has 
been obtained, but not convincing data concerning their 
pathogenesis. In mammalian cells, X-MLV-pseudotyped 
RSV can also replicate in the presence of X-MLV first to 
a low titer, which increases with passages. Furthermore, an 
envelope component of ALV-C phenotypically mixed with 
X-MLV was detected after passaging in mammalian cells. 

It is not known how X-MLV contributes to RSV replica- 
tion in mammalian cells. Obviously, it provides at least a 
part of the Env glycoprotein required for penetration to 
mammalian cells. However, additional factors complement- 
ing the non-permissiveness of mammalian cells to RSV 
should be in play. Due to its ability to multiply in such 
cells, X-MLV can increase and facilitate some posttran- 
scriptional steps such as viral RNA proper splicing and 
export from the nucleus and/or further steps involving clea- 
vage of viral protein precursors and virion assembly. These 
facilitating effects of X-MLV should be synchronized with 
RSV infection, because superinfection of an already RSV- 
transformed mammalian celL containing functional 
proviruses rescuable by fusion with chicken fibroblasts, 
does not result in virus production (Levy, 1977). 

Because X-MLV contribution to RSV replication in 
mammalian cells increases with passages, a possibility of 
genetic exchange between these viruses should be taken into 
account. In every case, these questions should be reinvesti- 
gated using presently available efficient tools of molecular 
biology. 

3. Retrovirus heterotransmission among species within 
the same class 

One of the first thoroughly analyzed successful hetero- 
transmissions was achieved by Duran-Reynals (1942), who 
showed that RSV produces both early and late appearing 
tumors when inoculated in young ducks. Since then, many 
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Fig. 2. Cloning of the aprt gene CpG island and RSV-based reporter proviruses. Filled boxes represent exons of the aprt gene. Vertical arrows denote the 
position of three Spl sites in the CpG island. Filled arrowheads denote transcription starts of the aprt gene and o' LTR. 
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Trans-species retrovirus transmission 



Virus 
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Original lithogenicity 
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Wasting disease, 
immunodeficiency 
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Mice 


Erytbroleukemia 


Rat 


Bone marrow suppression, 
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Mazgareanu et al.. 1998. 


ALV-C 


Chicken 


Anemia, probably 
immunodeficiency 


Duck 


Wasting disease, 
immunodeficiency, anemia 


Karakozet al.. 1980; 


ALV-B 




Anemia 






Smith and Schmidt. 1982. 
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other retroviruses, including these of mammalian origin, 
were experimentally transmitted among dilferent mamma- 
lian species. These transmissions were monitored mainly by 
virus oncogenic activity and virus persistence. Because of 
generally low retrovirus replication in foreign species, addi- 
tional pathogenic virus activity, such as immunosuppres- 
sion, usually do not appear. In many cases hetero- 
transmission produced the same symptoms as in the species 
of the virus origin, but there are well-documented cases of 
changed virus pathogenicity. Further, we shall deal mainly 
with the symptoms of wasting disease accompanied by 
immunodeficiency resulting in increased susceptibility to 
various infectious agents. 

The simian immunodeficiency viruses (SIV) highlight 
this situation. Generally, in the monkey species, in which 
SIV is indigenous, it replicates efficiently but does not 
produce any pathogenic changes. However, when trans- 
mitted to some other monkey species, SIV produces the 
immunodeficiency syndrome. Of special interest is SIV 
heterotransmission to humans. As is summarized in Table 
I, good evidence based on molecular biology and epide- 
miology has been provided documenting that both human 
immunodeficiency virus 1 (HIV-1) and HIV-2 represent a 
consequence of respective SIV sm (sooty mangabey SIV) and 
SIV cpz (chimpanzee SIV) heterotransmission to humans. 
There is no doubt about the significance of this finding, 
which should be taken as a warning against potential danger 
of retrovirus transgression of species barriers associated 
with fulminating pathogenic changes. In order to understand 
diese events, comparative data obtained with other members 
of the retrovirus family should be evaluated. 

Interesting observations were made in the case of bovine 
leukemia virus (BLV) (Table 1). This vims responsible for 
cattle leukosis when transferred to newborn rabbits triggers 
clear symptoms of immunodeficiency. Because such a 
response was not found in other infected species, these 
observations indicate that rabbits respond to BLV inher- 
ently, in an unusual way. 



In murine leukemia viruses, variants capable to produce 
immunodeficiency preferably have been isolated. This 
includes both Moloney MLV (Saha et al., 1994) and Friend 
leukemia complex (Faxvaag et al., 1993). In addition, as 
given in Table 1, Friend MLV transmission to newborn 
rats resulted in altered pathogenicity characterized by 
suppression of bone marrow cells, manifesting itself as 
reduced numbers of Thy 1 + cells. 

Avian leukosis viruses have not been thoroughly investi- 
gated from the point of view of their immunopathogenicity 
in foreign avian species. As given in Table 1, ALV subgroup 
C were studied using intraembryonic inoculation both in 
chicken and ducks. According to data obtained so far, this 
subgroup produces symptoms of anemia in both species. 
However, in heterologous duck hosts, a fatal wasting 
disease together with conspicuous atrophy of the thymus 
tissue starting the first week after hatching was observed 
both by a decrease in the relative thymus to body weight, 
histologically characterized by clear depletion of the thymus 
cortical layer (Fig. 3) (Stepanets et ah, 2000). Microscopi- 
cally, bursa Fabricii, which constitutes a special B-cell 
producing organ in birds, also displayed the cortical layer 
depletion. In agreement with this observation, production of 
humoral antibodies against Brucella abortus antigens was 
significantly decreased in infected animals, which confirms 
that immunodeficiency is involved. The nature of this 
immunodeficiency is being investigated and the character 
of specific T- and other cell alteration in thymus is moni- 
tored by specific antibodies. In addition, changes in 
lymphoid organs in young chickens intraembryonally 
inoculated with different ALV subgroups should be inves- 
tigated. According to our preliminary data, ALV-C does 
produce symptoms of thymus involution even in chickens. 
Thus ALV-C provides a suitable comparative system for 
establishing ways leading to the immunodeficiency 
syndrome both in homologous and heterologous hosts. 

There is not a simple answer to the question why retro- 
viruses in some species behave pathogenically or produce 
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Fig. 3. Thymus sections of 7-day-old ducklings. (A) Thymus of a control animal, mock-injected with tissue culture medium. (B) Thymus of an animal infected 
in mid-em bryogenesis with 1 0 5 ALV-C infectious virus. Thymic conical layer is visibly depleted. C. cortex: M. medulla: arrowheads mark the interim between 
cortex and medulla. Stained with UematoxyHn-eosin. Magnification 41 x . 



new or accentuated pathogenic symptoms. In some cases, 
such as Friend MLV, selection of virus mutants might be 
involved, but this seems not to be a general situation, 
because altered pathogenicity appears very soon after 
virus inoculation and is not correlated with certain specific 
virus gene alteration. Therefore other, especially host-speci- 
fic factors should be taken into account. It is not known so 
far which of them play a decisive role. Retrovirus toxicity is 
in the first step related to the outcome of the viral envelope 
(product of the env gene) interaction with target cells, 
immune and bone- marrow cells included. Therefore, if an 
infected cell harbors unusually expressed receptors and co- 
receptors as well as factors required for further steps of virus 
penetration, they may undergo deterioration or apoptosis. 
As was discussed in relation to HIV (Fauci, 1996), the 
degree of cytokine activity and the outcome of interrelation 
among different cytokine pathways contribute to virus 
expression and its pathogenicity. 

[n spite of the fact that heterotransmission of SIV took 
place probably several times, we are lacking the exact infor- 
mation about this process. In contrast to the experimental 
systems, natural conditions are probably different, because 



only low amounts of virus or better virus-infected cells from 
a donor might have mediated heterotransmission. Such a 
situation has not been modeled in detail, but it can be 
predicted that under such conditions the virus becomes effi- 
ciently expressed only rarely. Therefore, there might be a 
bridge connecting trans-class and trans-species retrovirus 
heterotransmission. 

Some experimental data obtained from long-term follow- 
up of retrovirus persistence (Trejbalova et al., 1999) suggest 
that provirus silencing discussed in the first and also next 
chapter might play a role in the control of low-dose infec- 
tion and long-term retrovirus persistence. 



4. Provirus silencing and expression in differentiated 
cells 

In order to investigate the role of cell differentiation in 
relation to retrovirus infection, several models have been 
designed, but most thoroughly were analyzed EC cells, 
which differentiate in vitro. The entree to this problem 
was given by the group of Jaenisch (Stewart et aL 1982). 
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who discovered that MLV integrated in EC cells becomes 
methylated and unexpressed. In a series of papers from 
different laboratories (reviewed by Challita et al., 1995) it 
was established that in order to ensure MLV expression in 
EC, introduction of a transcription factor Spl binding site 
into LTR is required together with inactivation of the nega- 
tive control region (NCR) in LTR as well as of the region of 
the primer binding site. As the third negatively acting 
element, one out of two direct repeats in LTR was recog- 
nized (Hawley et al., 1 994). There is still room left for MLV 
improved expression, as exemplified by insertion into LTR 
of an antimethylation fragment from the region upstream of 
the Thyl gene (Challita et al., 1995). 

All the above-mentioned modifications act synergisti- 
cally. Recently, MLV LTR lacking known negative 
elements due to deletion spanning most of the 5 l end LTR 
portion has been constructed (Osborne et aL, 1999). As a 
result of such a deletion, about half of 13 CpG sites within 
LTR were also lost. Such truncated LTR, essentially 
stripped of enhancer elements, was employed for generation 
of a retroviral vector harboring the neomycin resistance 
(neo) reporter gene equipped with internal p-globin promo- 
ter, which after infection of EC cells displayed expression in 
70% of cells, the highest efficiency obtained so far. 

We have not yet reached the end of the journey to optimal 
retroviral vector function in differentiated cells. It is still 
possible that additional, so far undefined sequences should 
be inactivated, altered or inserted. Of special importance 
might be CpG dinucleotides present in LTR, especially at 
the start of transcription. Elements acting as antimethylation 
signals, interfering with silencers or ensuring position-inde- 
pendent gene expression, such as locus control regions 
(LCR), should be tested for their ability to ensure retrovirus 
or retroviral vector expression in differentiated cells. 
Recently, the chicken hypersensitive site 4 (cHS4) of the 
chicken globin LCR, acting as an insulator, when cloned 
into MLV LTR was shown to increase the probability of 
integrated provi ruses expression and to decrease the level of 
de novo methylation of the 5 'LTR in murine erythroleuke- 
mia (MEL) cells (Rivella el al., 2000). In addition, the 
human p interferon scaffold attachment region (JFN- 
SAR), when inserted in retroviral LTR, prevented its methy- 
lation and ensured vector expression in a stably transfected 
line of human T cells. The vector expression has been kept 
for several months and included also multiple proviral 
copies (Agarwal et al, 1998; Dang et al., 2000). We can 
therefore stress the point that the problem of permissiveness 
of differentiated cells to retrovirus infection goes far beyond 
EC cells and that other differentiated cells, such as hemato- 
poietic or hepatic cells, should probably require not only 
prevention of vector downregulation, but even some more 
specific cell changes like activation of steps triggering the 
cell cycle (rev. Emerman, 2000). 

Dealing with provirus silencing in differentiated cells we 
focused on provirus methylation as an epigenetic DNA 
modification described repeatedly in conjunction with 



provirus downregulation. However, it is not clear so far 
whether provirus methylation acts as the primary cause or 
whether it only conserves transcriptional repression. In 
provirus methylation studies, usually a general increase in 
CpG methylation has been measured. However, some more 
precise data have been obtained showing that methylation of 
one particular CpG has a decisive effect. This is the case of 
HIV, where methylation of only one CpG at position - 143 
(in the vicinity of NF-kB and Spl binding sites) in HIV LTR 
results in 70% inhibition of the reporter expression 
(Bednarik et al., 1990; Schulze-Forster et al., 1990). Simi- 
larly, single CpG site methylation in the RSV LTR U5 
region nearby the transcription start leads to a clear decrease 
in provirus expression (Hejnar et aL, 1999). As discussed 
later, the density of CpGs is also of importance. 

It has been recognized on other gene models that methy- 
lation provides a signal for association with methyl-CpG- 
binding protein 2 (MeCP2), which, through the adaptor 
protein Sin3A, recruits the histone deacetylase (HDAC) 
(rev. Razin, 1998; Ng and Bird, 1999; Knoepflerand Eisen- 
man, 1999). A similar situation was disclosed in the case of 
MeCPI, which produces a complex composed of the MBD 
protein containing the methyl-CpG-binding domain and two 
members of the HDAC family (rev. Bird and Wolffe, 1999). 
Thus, there is a proven link between CpG methylation and 
chromatin deacetylation. 

How are these findings related to provirus silencing? It 
should be noted that MeCPI has been already shown to bind 
to methylated LTR of myeloproliferative sarcoma virus, 
suppressing its LTR activity (Boyes and Bird, 1991). This 
problem has been recently approached by Lorincz et al. 
(2000). They employed the MLV LTR-driven green fluor- 
escence protein (GFP) gene and followed GFP expression 
after infection of MEL cells with this vector. Cell clones that 
displayed silenced GFP were isolated and it was disclosed 
that early after silencing the proviruses became methylated 
to a low degree. Such clones could have been reactivated by 
trichostatin (TSA), which inhibits HDAC-MeCP2 
complexes. During prolonged cell cultivation the vector 
has been increasingly methylated and in the hypermethy- 
lated state provirus expression was induced only by combi- 
nation of TSA and 5-azacytidine (5-azaC), which acts as a 
demethylation agent. These results suggest that provirus 
downregulation is a dynamic process and that the possibility 
of its reactivation depends upon the density of methylation. 

All the data obtained so far point to the important role of 
methylation in provirus silencing in general, and therefore 
strategies preventing methylation of retroviral vectors as 
well as putative blocking of methyl-CpG-binging proteins 
should contribute to more efficient gene therapy applied to 
differentiating or differentiated cells. 

There is, in addition, an important question: whether and 
why retroviruses are more efficiently recognized and 
silenced than any foreign DNA introduced in a genome. 
In other words, does there exist a cell genome surveillance 
mechanism (analogous but not homologous to immunity) 
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lhat ensures down regulation of retroviral sequences? 
Methylation has been already proposed to fulfil such a 
duty (Doerfler, 1991; Yoder et al., 1997). However, some 
signals should be involved that attract methyltransferase to 
an integrated provirus, or even to a specific DNA structure 
common ro viral integration intermediates (Bestor, 1987). 
Such signals might be provided by flanking chromosomal 
sequences or by the pro viral structure itself. Especially LTR 
could be recognized as unusual direct repeats. Similar struc- 
tures in lower eukaryotes trigger gene silencing (rev. Wolff e 
and Matzke, 1999). Furthermore, MLV LTR binding zinc 
finger transcription factor YY-1 (Flanagan el al., 1992) was 
shown to represent, a homolog of Sin3 (Yang et al., 1996), 
which has been already identified as a protein complexing 
with HDAC involved in formation of transcriptionally inac- 
tive chromatins It should be also taken into account lhat in 
some heterologous and differentiated cells, proteins activat- 
ing LTR might be underrepresented or that such cells 
produce altered isomorphic proteins, which could be inac- 
tive and/or could interfere with factors required for LTR 
activation. Therefore, retroviral genomes should be 
screened also from the point of view of sequences and 
DNA-protein complexes which might contribute to provirus 
silencing, of course, in context with flanking chromosomal 
DNA. New techniques such as inverse polymerase chain 
reaction should facilitate this demanding task. 
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Developmental control of gene expression has a major impact on the 
design of P-globin retrovirus vectors for hematopoietic stem cell gene 
therapy of P-thalassemia. It is obvious that the endogenous locus con- 
trol region (LCR) elements that drive P-globin gene expression in 
transgenic mice must be included in these vectors. However, the specific 
elements to use are not clear and require an understanding of LCR 
action. Moreover, retrovirus vectors contain silencer elements that 
function in stem cells and are dominant to LCR function. Recent stud- 
ies on LCRPrglobin transgenes and retrovirus silencing suggest ways to 
overcome this silencing effect after transfer into stem cells and carefully 
designed lentivirus vectors have exciting therapeutic benefit in animal 
models of P-thalassemia. By building on 15 years of development, 
LCRP-globin vectors are now being tested, in preclinical animal models 
and may mtimately lead to the long-sought cure for this genetic dis- 
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p-thalassemia is caused by genetic defects that re- 
duce p-globin protein levels (1). The resulting im- 
balance of P-globin to a-globih chains results in a 
severe anemia that is generally treated by repeated 
blood transfusions. A consequence of such regular 
blood transfusion is the increased risk of exposure 
to virus-infected blood supplies, and eventually 
leads to iron overload that can cause organ failure 
(2). Present iron chelation treatments are contro- 
versial (3), and the only cure available is bone 
marrow transplantation from a matched sibling if 
available. An attractive alternative is to perform 
gene therapy to deliver a human P-globin gene into 
hematopoietic stem cells (HSQ from the patient 
(4, 5). 

For P-globin gene therapy to be successful, it is 
essential that the transferred gene be expressed to 
the correct level. This goal requires a detailed 
knowledge of the mechanism and cw-acting se- 
quences that control P-globin expression during 
development. Surprisingly, inclusion of the appro- 
priate regulatory elements may not be sufficient to 



obtain therapeutic levels during gene therapy as 
the vectors used to deliver the gene are frequently 
silenced in transduced stem cells. Hence, a better 
understanding of the mechanism of vector silenc- 
ing in stem cells is also required. Here, we review 
the regulatory elements that control P-globin gene 
expression during development and their use in 
P-globin retrovirus vectors, outline the evidence 
that retrovirus and lentivirus vectors are silenced in 
stem cells and potential means to overcome this 
silencing, and we conclude with preclinical anrmal 
models to test promising p-globin gene therapy 
vectors. 

p-globin gene expression during development 

The P-globin gene is part of a cluster of highly 
related globin genes located on Chr llplS in hu- 
mans (6). These genes are arranged in the same 
order as they are expressed during development 
(Fig. 1). The e-globin gene is expressed in the 
j>lood islands of the yolk sac, the" site of hemata- 
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poiesis then switches to the fetal liver where the 
y-globin genes are expressed, and shortly after 
birth hematopoiesis switches to the bone marrow 
where the 5- and P-globin genes are expressed to 
very low and high levels, respectively. Many muta- 
tions that cause ^-thalassemia have been described 
and some have been informative with regard to Hie 
sequences and molecular mechanisms that control 
globin gene switching. For example, it is clear that 
point mutations in the y-globin promoters can 
enhance expression of these genes in adults causing 
hereditary persistence of fetal hemoglobin 
(HPFH). Deletion of upstream sequences that in- 
clude the locus control region (LCR) results in a 
lack of expression from the still intact globin genes. 
These data demonstrate that promoter sequences 
and the LCR are important control elements for 
globin expression. 

The p-globin locus control region 

The LCR is composed of at least four DNasel 
hypersensitive sites (HS) located upstream of the 
locus (Fig. 1) (7-9). The presence of HS indicates 
that transacting factors are binding to these re- 
gions and displacing or destabilizing nucleosomes. 
Nucleosomes are the basic units of chromatin and 
condense DNA around an octomer of the histone 
proteins H2A, H2B, H3 and H4 (10). Expressed 
genes are located an 'open 9 chromatin that is more 
accessible to trans-acting factors and in general 
contain nucleosomes with highly acetylated his- 
tones. In contrast, 'closed' chromatin generally has 
deacetylated histones and is bound by the linker 
histone HI, is less accessible to DNA binding 
factors and genes in these regions are not ex- 
pressed. Chromatin structure is modulated using 
many chromatin remodelling complexes (11, 12). 



Embryonic Fetal Adult 
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Fig. 1. Gene structure and expression pattern of the human 
P-globin locus. The locus control region (LCR) is composed of 
at least 4 DNasel hypersensitive sites (HS) located upstream of 
the cluster. Traw-acling factors (red circles) bind to the HS. In 
transgenic mice, LCR activity directs copy number-dependent, 
position-independent transgene expression. Full levels of p- 
globin transgene expression are obtained in the presence of all 
four HS including their core elements (red boxes). 



The mechanism by which the LCR controls P- 
globin gene expression has been extensively stud- 
ied, primarily using transgenic and knockout mice 
(6). It is important to realize that these two assays 
manipulate the genes in quite different ways and 
the results are not always complementary or in 
agreement with each other. Transgenic mice con- 
tain the human P-globin gene transferred into 
novel or ectopic integration sites; whereas, knock- 
out mice manipulate the endogenous native locus 
in the mouse. In transgenic mice, human P-globin 
transgenes are silent at most integration sites or 
transcribed to about 1% of the endogenous mouse 
Pmajor level. In contrast, addition of the 
including all four HS to the P-globin transgene 
results in expression to about 100% ievels at all 
integration sites, and expression is copy number- 
dependent (Fig. 1) (7, 13, 14). This copy number- 
dependent, position-independent transgene expres- 
sion is unusual and is the defining feature of LCR 
activity. Further investigation demonstrated that 
individual HS2, HS3 and HS4 elements and their 
smaller 'cores' of approximately 200-300 bp, also 
direct copy number-dependent transgene expres- 
sion but to lower levels (10-25%) (15-20). 

The LCR is often referred to as an enhancer, but 
does not have classic enhancer activity because it 
does not function equally well in either orientation 
(21). Rather, it appears that complete LCR activity 
requires all four HS (22), and these have some 
distinct roles. For example, HS3 can activate P- 
globin transgenes at all single copy integration sites 
where it establishes open chromatin and remodels 
chromatin on the promoter to permit expression 
(13). In contrast, although HS2 has strong en- 
hancer activity in transient transfection studies 
(23), it is unable to direct expression in single copy 
. --transgenic -mice (24).- These data suggest that-at 
ectopic sites, the HS function together as a unit, 
making the LCR sufficient to open chromatin and 
enhance full expression of p-globin transgenes. 

Open chromatin is likely to be established by the 
binding of erythroid transacting factors that re- 
cruit chromatin remodelling complexes (25-31), as 
has been described for histone acetylation changes 
on active human P-globin genes (32). This open 
chromatin may not extend throughout transgenes 
containing the entire human LCRP-globin cluster, 
as different domains that correlate with the pres- 
ence of low level intergenic transcription have been 
described during globin switching in mice (33). 

Two models of LCR activity 

The transgenic mouse data have largely been inter- 
preted as supporting a holocomplex model of LCR 
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Fig. 2. Two models of LCR activity. A) The holocomplex 
model suggests that factors (red circles) bound to the HS 
interact with each other by DNA looping (arrows) to form a 
single LCR holocomplex that loops to activate expression 
from the appropriate globin promoter. B) The linking mode] 
suggests that the LCR serves to ensure that factors (coloured 
circles) are bound Jhroughout the locus. Nucleosomes in 'open' 
chromatin are represented as dispersed green circles, 'closed' 
chromatin as condensed solenoids of green circles. 

action at ectopic sites (34). Once open chromatin 
has been established, it is proposed that each of the 
HS then interacts with each other by DNA looping 
mediated via the bound factors to form an LCR 
holocomplex (Fig. 2A). The holocomplex would 
then interact with a single globin gene in the clus- 
ter, and svritching during development would be 
accomplished by stage-specific silencer elements as- 
sociated with the £- and y-globin genes. Although 
there is no direct physical evidence for DNA loop- 
ing between the LCR and the globin promoters in 
vivo, the holocomplex model is supported by find- 
ings that only a single globin gene is transcribed at 
a time in transgenic mice containing the whole 
human LCRP-globin cluster (35), and that the 
LCR preferentially activates genes closest to it 
(36). Deletions that remove only a single HS 'core 5 
element drastically reduce transgene expression in 
comparison to those that delete an entire HS frag- 
ment (37, 38), indicating that removal of a core 
creates a defective holocomplex (39). 

Quite different conclusions have been arrived at 
using knockout technology on the mouse (J-glohin 
locus (40). Deletions of individual or all the HS in 
the endogenous locus do not alter chromatin struc- 
ture and have relatively minor effects on expres- 



sion of the globin genes (41, 42). These data 
suggest that the LCR is not required for chromatin 
opening at the endogenous mouse P-globin locus, 
and suggest that more distant elements control 
chromatin structure (43). A linking model that 
does not invoke DNA looping has been proposed 
to explain the knockout results (Fig. 2B) (44). In 
this model, the function of the LCR is to enhance 
P-globin expression by ensuring that factors are 
bound at intervals across the cluster and that the 
gene is localized to the right nuclear compartment 
(32). The hnking model is not consistent with the 
ability of the LCR to open chromatin at ectopic 
transgene sites, but the holocomplex model cannot 
easily explain the effect of LCR deletions in the 
mouse P-globin locus. As described below, the two 
models are not necessarily mutually exclusive and 
may be strengffierie^^^rjjr^eiiig "merged. For the 
purpose of gene therapy where globin expression 
cassettes delivered by viral vectors must express at 
ectopic sites, it will be important to design the 
cassettes based on transgene constructs that ex- 
press to high levels at single copy integration sites. 

LCRp-globin expression cassettes for gene therapy 

To express therapeutic levels of P-globin from gene 
therapy cassettes, full expression levels should be 
obtained from a single copy integration in order to 
convert a null thalassemia into an asymptomatic 
carrier state. Initial P-globin gene therapy cassettes 
were designed to be as small as possible to facili- 
tate gene transfer and used either cDNA or ge- 
nomic P-globin fragments controlled by minimal 
promoters (45-47). Addition of small HS core 
elements improves expression in tissue culture ex- 
periments, but were largely disappointing when 
transferred into mouse bone marrow cells (48, 49). 
Recent single copy transgenic mouse experiments 
now demonstrate that full expression by the LCR 
requires all four HS and specific elements within 
the |}-globin gene including the — 1555 bp pro- 
moter and the 3' enhancer (Fig. 3 top) (50). As this 
8.8-kb cassette is too large for conventional retro- 
virus vectors, smaller constructs that express highly 
are required for retrovirus delivery. A very promis- 
ing new 3.9-kb cassette expresses Y-globin rnRNA 
to 70% levels in single copy transgenic mice, using 
P-globin promoter, intron 2 and 3' enhancer ele- 
ments (Fig. 3 centre) (51). This construct, or others 
shown to function effectively in transgenic mice, 
may ultimately prove to be best suited for gene 
therapy and take advantage of the anti-sickling 
properties of y-globin. In addition, these data 
demonstrate that the LCR must functionally inter- 
..act with more than just the promoter. The simplest^ 
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interpretation merges the holocomplex and linking 

models by suggesting that the LCR loops to inter- MoMLV 

act with the promoter, but factors must also be 

bound throughout the gene. SiN 



6* LTR 



3'LTK 




Retrovirus and lentivirus vectors 

Retrovirus vectors have been the method of choice 
for delivering P-globin genes into hematopoietic 
stem cells because they stably integrate at single 
copy into the genome (52). In practice, it has been 
very difficult to obtain high titer P-gJobin retro- 
virus vectors due to instability of the LCR ele- 
ments and globin intron 2 sequences (48, 49). To 
stabilize transmission of intact ^-globin genes it is 
necessary to use only certain combinations of HS 
sites and to delete an AT-rich region in intron 2. 
Although these modifications permit generation of 
high titer retrovirus, it is now apparent that the 
deleted AT-rich sequence is required for high level 
expression (51). 

A limitation of retrovirus vectors is that they 
integrate only into cycling cells, and the target 
HSC for P-globin gene therapy tend to be non-cy- 
cling. In this regard, lentivirus vectors based on 
HIV-1 (53) are far superior as they integrate into 
non-cycling cells and contain the RRE element 
that is bound by the Rev protein to stabilize the 
virus genomic RNA (52). One exciting report of a 
P-globin lentivirus vector demonstrates that it is 
possible to transmit large LCR fragments coupled 
to a P-globin cassette with a small promoter, AT- 
rich deleted intron 2 and no 3' enhancer (Fig. 3 
bottom). Therapeutic levels of p-globin mRNA 
and protein were shown in transduced bone mar- 
row in a mouse model of P-thalassemia (54). 
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Fig. 3. LCRP-globin expression cassettes for gene therapy. 
Full expression in single copy transgenic mice requires an 8.8 
kb construct (top) including all four HS, the large 0-globin 
promoter, p-globin intron 2 AT-rich region, and the intron 2 
and 3' enhancers (yellow triangles). High level expression of 
y-globin exons (Blue boxes) is obtained from a smaller 3.9 kb 
P/y-globm hybrid cassette (centre). As both of these cassettes 
are not stable in retrovirus vectors, cassettes with improved 
stability (bottom) have deletions of deleterious sequences. 





Insulated expression y 

Fig. 4. Retrovirus silencing in stem cells. Silencer elements in 
MoMLV vectors are primarily located in the LTRs (top), and 
silencing can be ameliorated but not eliminated by SIN muta- 
tions in the 3XTR. Insulator elements (INS) in the LTR- 
(centre) block silencing by position effects (red arrows with X) 
in MEL cells but not retrovirus silencing in stem cells (large 
red arrow). The ideal vector (bottom) uses a SIN lentivirus 
vector, insulator elements n^nking a small HS3P/Y-globin ex- 
pression cassette to prevent silencing (green arrow), and no 
marker gene. 

Silencing of retrovirus vectors in stem cells 

Once the P-globin expression cassette has been 
transduced into an HSC, it must express to appro- 
priate levels in erythroid cells produced from the 
HSC. One concern that has severely limited clinical 
gene therapy is that retrovirus-transduced genes 
are often silenced in HSC (52, 55-57). It has long 
been known that retroviruses are silenced in em- 
bryonic stem cells and in transgenic mice (58, 59). 
Moreover, it has been shown that retrovirus and 
lentivirus sequences silence LCR(}-globin transge- 
nes in mice (60, 61), and that the transgenes fail to 
-open chromatin (60). These data- demonstrate- that- 
the vectors have silencer elements, and suggest that 
they are recognized by specific factors that are 
restricted to stem cells in mammals. 

There are several possible means to overcome 
vector silencing in stem cells. First, the silencer 
elements can be defined in the retrovirus or 
lentivirus vectors and, if not required for virus 
replication, can be removed. As the silencers tend 
to be located in the viral long terminal repeats 
(LTRs) that control virus transcription (Fig. 4 
top), mutations produce self-inactivating (SIN) 
virus that is inherently safer. Several SIN retro- 
virus vectors have been developed from Moloney 
murine leukemia virus that express to higher levels 
in ES cells and transgenic mice (62-65). Neverthe- 
less, none of the SIN vectors express at all integra- 
tion sites in stem cells. To overcome this residual 
silencing in the "SIN vectors, it may be possible to 



20 



Beta-globin locus control region versus gene therapy vectors 



incorporate insulator elements, to block the spread 
of silencing from surrounding retrovirus sequences 
(66). The chicken P-globin cHS4 element is an 
insulator (66, 67) and has been placed in the LTR 
of retrovirus vectors to block position effects in 
murine erythroleukemia (MEL) cells (Fig. 4 centre) 
(68, 69). However, an insulator in the LTR does 
not shield internal genes from silencing established 
on the retrovirus sequences in ES cells (68). A 
better construct design would position insulators 
on both sides of the internal gene rather than in 
the LTR (Fig. 4 centre). 

Mechanism of retrovirus vector silencing 

An understanding of the mechanism of retrovirus 
silencingmay pennit_interventions that prevent.its._. 
establishment or maintenance (70, 71). Many 
groups have correlated de novo cytosine methyla- 
tion (72) of CpG dinucleotides in retroviruses with 
the silenced state (58, 59, 68, 73, 74), and expres- 
sion can be reactivated to a low level using the 
methylation inhibitor 5 AzaCytidine (5AzaC) (75). 
However, this indirect evidence does not address 
whether methylation is a cause of silencing or a 
consequence. Recent direct evidence that retrovirus 
silencing isindependent of d% novo methylase func- 
tion has been obtained using dnmt3 knockout ES 
cells and transgenic Drosophila that have no 
methylase activity (60). As chromatin of retrovirus- 
silenced LCRP-globin transgenes in mice is inac- 
cessible to DNasel and marked by deacetylated 
histone H3 and bound linker HI (60), it appears 
that chromatin modifications play a role in retro- 
virus silencing: Attempts to relieve silencing may, 
therefore, require inhibitors of histone deacetylases 
(HDAC) or of HI binding. 

Silencing is often established by one pathway 
and. maintained by another. Time course experi- 
ments in ES cells demonstrate that most retro- 
viruses are silenced " within 2 days but some 
integration sites escape complete silencing and ex- 
press to low levels (60, 76, 77). As methylation is 
not detectable by 2 days post-infection, methyla- 
tion is likely to be a consequence of, or secondary 
step in, retrovirus silencing. Most experiments 
have focussed on the subset of infected cells that 
initially express. Over time, these are gradually 
silenced in a process known as extinction. In in- 
fected MEL cells, extinction can be reversed early 
in the process using the HDAC inhibitor Tricho- 
statin A (TSA) (78). However, the methylation 
inhibitor 5 AzaC is required in addition to TSA to 
overcome extinction at later time points. These 
data suggest that methylation is an important sec- 
ondary or associated step in extinction of virus--* 



expression in mature cell types. The ability of TSA 
or 5 AzaC to activate expression in the majority of 
transduced cells that are completely silenced from 
the outset has not been rigourously tested to date. 
In summary, efforts to prevent retrovirus silencing 
using methylation and HDAC inhibitors hold 
promise, but require more knowledge of the mech- 
anism and demonstration of their utility in silenced 
stem cell populations prior to extinction. 

An ideal LCRp-globin lentivirus vector 

The ideal p-globin gene therapy vector should sta- 
bly integrate into an HSC at high efficiency and be 
expressed to near endogenous levels at single copy. 
To accomplish this goal, lentivirus vectors have a 
clear adv antage in their a bility to infect non-cy- 
cling stem cells and stably transmit large LCRp- 
globin expression cassettes. The best existing 
LCRP-globin lentivirus vector has these features 
(54), with the additional advantage , of omitting a 
selectable marker gene. Although marker genes are 
convenient for determining transduction frequen- 
cies, most are derived from non-mammalian 
sources and may themselves be subject to gene 
silencing effects. However, the vector can be opti- 
mized further to direct expression at all integration 
sites and for vector safety (Fig. 4 bottom). First, a 
SIN version of the lentiviral vectors must now be 
used with third generation packaging systems de- 
signed to prevent recombination events that gener- 
ate replication-competent HTV-1 virus (79). The 
SIN lentivirus vector will not only prevent rescue 
and spread of the vector by any helper virus, but 
may also improve expression of the LCRP-globin 
cassette (60). Second, the LCRP-globin cassette 
should be flanked by insulator elements that are 
known to block silencing. In practise, this may 
require a different insulator on one side than the 
other to avoid recombination events that delete the 
LCRP-globin cassette. Finally, the LCRP-globin 
cassette should express highly at single copy in 
transgenic mice. Such constructs use HS3 coupled 
to a large p-globin promoter, the P-globin intron 2 
including both the AT-rich region and enhancer 
and the 3' enhancer (51). Use of hybrid genes 
permits expression of anti-sickling y- or 8-globin 
coding sequences instead of p-globin exons (51, 
80). A combination of these components should 
create a lentivirus vector that is safe and expresses 
therapeutic levels of globin. 

Preclinical models of p-thalassemia 

Promising p-globin gene therapy vectors have been 
tested primarily in MEL cells, or in infected mouse 
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bone marrow. Expression in erythroid cells derived 
from primitive, progenitor cells can be assayed us- 
ing CFU-S assays in which spleen colonies are 
formed after 12 days in vivo. Expression in cells 
derived from an infected HSC must be assayed 
after long-term repopulation assays, followed by 
secondary transplantation. To show therapeutic 
efficacy, these long-term studies should be per- 
formed first on mouse models of (3-thalassemia and 
sickle cell anemia. A variety of these models have 
been created by gene targeting and transgenic tech- 
nology (81-83). These mouse models can be cor- 
rected by expression of y-globin transgenes, and it 
has been shown that an LCR^-globin lentivirus 
vector can express f$-globin mRNA and protein in 
these models with therapeutic benefit (54). 

Despite success with the mouse models, it has 
proved much more difficult to transduce human 
HSC than murine HSC (84). Fortunately, another 
preclinical model of P-globin gene therapy into 
human stem cells is available (Fig. 5). The NOD- 
Scid mouse is severely immunocompromized and 
fails to reject human bone marrow, transplants 
(85). Transplanted human HSC home to the mouse 
bone marrow where they are supported by the 
hematopoietic microenvironment. Human bone 
marrow cells from (3-thalassemia and sickle cell 
patients have been shown to repopulate the bone 
marrow of these mice and generate human red cells 
that mimic the disease. This system is very well 
suited to test expression from LCRP-globin 
lentivirus vectors in human stem cells. To this end, 
patient bone marrow or sorted HSC would be 
infected with the lentivector in vitro prior to in vivo 




Fig. 5. The NOD-Scid preclinical mouse model of human stem 
cell gene therapy for p-thalassemia or sickle cell anemia. Pa- 
tient bone marrow stem cells are infected with the p-globin 
lentivirus vector in vitro, prior to in vivo engraftment into 
immunocompromised NOD-Scid mice. Gene transfer into en- 
grafted human stem cells, p-globin transgene expression, and 
phenotypic improvement can then be monitored without ex- 
posing patients to experimental lentivirus vectors. 



assay in the NOD/Scid mice. Hence, long-term 
expression in human stem cells can be assayed in 
vivo without exposing patients to experimental 
lentivirus vectors. 

Future prospects 

The first retrovirus vectors for P-globin gene ther- 
apy were designed over 15 years ago* and through 
slow careful research many obstacles were discov- 
ered and gradually surmounted. The success of a 
well-designed LCR P-globin lentivirus vector in 
correcting a mouse P-thalassemia model is a mile- 
stone in this process that can now be completed 
through incremental improvements to vector ex- 
pression and safety. Ultimately, validation of p- 
globin gene therapy in human stem cells using the 
NOD-Scid preclinical model will justify clinical 
trials of this exciting potential cure for 
hemoglobinopathies. 
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ABSTRACT We describe studies of gene transfer and 
expression of the human glncocerebrosidase cDNA by a Molo- 
ney murine leukemia virus (MoMuLV>based retroviral vector 
in a marine gene transfer/bone marrow transplant (BMT) 
model. Pfuripotent hematopoietic stem ceUs (HSCs) were as- 
sayed as the colony-forming units, spleen (CFTJ-S) generated 
after serial transplantation. Transcriptional expression from 
the MoMuLV long-terminal repeat (LIU) was detected at a 
high level in the primary (1°) CFU-S and tissues of reconstituted 
BMT recipients. However, we observed transcriptional inac- 
tivity of the pro viral MoMuLV-LTR in >90% of the secondary 
(V) CFU-S and in 100% of the tertiary (3°) CFU-S examined. 
We have compared the methylation status of the pro virus in the 
1° CFU-S, which show strong vector expression, to that of the 
tnnscripoonaDy inactive provirns in the V and 3* CFU-S by 
Southern blot analysis using the memylatioo -sensitive restric- 
tion enzyme Sma L The studies demonstrated a 3- to 4-fold 
increase in methylation of the Sma I site in the proviral LTR of 
2° and 5° CFU-S compared to the transcriptionally active 1° 
CFU-S. These observations may have important implications 
for future chnkal applications of retroviral-mediated gene 

needed for an enduring therapeutic effect. 



Gene therapy via bone marrow cells is a promising technique 
for treatment of a wide variety of human diseases, including 
genetic disorders, cancer, and AIDS. Effective long-term 
bone marrow gene therapy requires the fulfillment of two 
main criteria. The exogenous gene should be introduced into 
a high percentage of long-lived pluripotent hematopoietic 
stem cells (HSCs). Subsequently, the introduced gene should 
be persistently expressed in the mature hematopoietic prog- 
eny cells of the stem cell, thereby maintaining the effects of 
gene therapy for the lifetime of the individual. Although 
Moloney murine leukemia virus (MoMuLV>based retroviral 
vectors are currently the most efficient vehicles for gene 
transfer into a variety of cell types including HSCs (reviewed 
in ref. 1), the long-term in vivo expression from the viral 
promoter/enhancer elements has been unsatisfactory. Lack 
of gene expression from the 5 ' MoMuL V long-tenrrinai repeat 
(LTR) has been observed in several systems including pri- 
mary fibroblasts (2) and hematopoietic cells (3, 4). Previous 
studies by our laboratory, using a retroviral vector in which 
a normal human glucocerebrosidase (GO cDNA is controlled 
by the enhancer/promoter of the 5' MoMuLV-LTR, dem- 
onstrated a high rate of lack of expression in cells derived 
from HSCs (5). 

Methylation of cytosine residues has been shown to be 
associated with suppression of gene expression and, in cer- 
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tain circumstances, with the silencing of viral control ele- 
ments (6). The MoMuLV-LTR is completely inactive in 
embryonic stem and embryonic carcinoma cell lines, and the 
inactivity is accompanied by de novo methylation of the 
proviral sequences (7, 8). Moreover, methylation has been 
detected in association with the MoMuLV-LTR transcrip- 
tional inactivity in fibroblasts in vitro (9) and in vivo (2). 

In this study, we investigated long-term in vivo expression 
from the MoMuLV-LTR by transduction of murine bone 
marrow cells with a MoMuLV-based retroviral vector and 
serial bone marrow transplantation (BMT) into lethally irra- 
diated recipient mice. We document a high rate of expression 
failure associated with methylation of the vector LTR in the 
secondary (2°) and tertiary (3°) colony-forming units, spleen 
(CFU-S). 

MATERIALS AND METHODS 

Retroviral Vector. The G2 retroviral vector and its corre- 
sponding high-titer amphotropic PA317 packaging cell clone 
have been described (5). G2 consists of the LTR from the N2 
vector flanking the human GC cDNA. The packaging cell line 
clone used in the experiments was negative for helper virus 
production assayed by testing for transfer of the amphotropic 
env gene into 3T3 fibroblasts through PGR analysis (10). 

Transduction of Murine Bone Marrow Cells. Donor bone 
marrow cells were harvested from male C57BL/6J mice 
(Charles River Breeding Laboratories), prestirnulated in the 
presence of growth factors, and cocultivated over vector 
producing fibroblasts according to the methods described by 
Weinthal et ai (5). The growth factors used for the pre stim- 
ulation were 200 units of murine interieukin 3 (IL-3) per ml 
(Biosource, Camarillo, CA), 100 units of human IL-6 per ml 
(Amgen), 200 units of human IL-la per ml (Immunex) and 50 
ng of mast cell growth factor per ml (or c-kit ligand; Im- 
munex). 

BMT and Sample Collection. Recipient female C57BL/67 
mice (8-12 weeks old) were irradiated with two split doses of 
600 and 450 cGy 24 hr apart. Transduced bone marrow cells 
were injected into the tail vein of the irradiated mice at 1 x 
10 6 cells per mouse for isolation of CFU-S or 2-4 x 10* cells 
for long-term reconstitution. Twelve days after BMT, two to 
four mice transplanted with 1 x 10* bone marrow cells were 
sacrificed. Well-defined, individual primary (1°) CFU-S were 
isolated and divided evenly into two portions, one for DNA 



Abbreviations; MoMuLV, Moloney murine leukemia virus; LTR, 
long terminal repeat; HSC, hematopoietic stem cell; GC, glucoce- 
rebrosidase; CFU-S, colony forming unit, spleen; BMT, bone mar- 
row transplantation; I s , primary; 2°, secondary; 3*, tertiary. 
*To whom reprint requests should be addressed at; Division of 
Research Immunology/Bone Marrow Transplantation, Chfldrens 
Hospital Los Angeles, 4650 Sunset Boulevard, Los Angeles, CA 
90027. 
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•This set of 2* CFU-S was analyzed for RNA expression by reverse transcription/PCR. 



and one for RNA analysis. Animals transplanted with 2-4 x 
100 bone marrow cells were sacrificed after 1-3 months. 
Hematopoietic tissues were collected for nucleic acid anal- 
ysis, and bone marrow cells were used directly to reconsti- 
tute a second generation of lethally irradiated female mice. 
Twelve days after the secondary BMT, the 2° CFU-S were 
isolated for DNA and RNA analysis. In one experiment, 3* 
BMT was performed from bone marrow of long-term- 
reconstituted 2° recipient animals in order to generate 3° 
CFU-S. 

DNA and RNA Analysis* Genomic DNA was isolated by 
SDS/proteinase K and RNase digestion at 55°C for 3-4 far. 
The digested tissues were extracted with phenol/chloroform; 
the DNA was precipitated in ethanol and resuspended in TE 
buffer. The presence of pro viral GC sequences in the CFU-S 
and hematopoietic tissue samples was assayed by PGR using 
the human GC-specific oligonucleotide primers described by 
Weinthal et at (5), followed by Southern blotting and hy- 
bridization with a 32 P-end-labeIed internal oligonucleotide 
(8). Pro virus DNA was also detected by Southern blot 
analysis after digestion of genomic DNA with the Sst II and 
Xho I restriction enzymes (BRL). These digestions release 
the 1.65-kb GC cDNA detected by hybridization with the 
1.5-kb (Sst U/BamBI) human GC cDNA probe. The probe 
was labeled with p^dCTP by the randonvprhning method. 
Individual pro virus integrants in the CFU-S and long-term 
hematopoietic tissues were detected by Southern blot anal- 
ysis of genomic DNA digested with BamHI, whjch cuts at 
one site in the provirus. Again, the Southern blot was 
hybridized with the 1.5-kb ^-labeled human GC cDNA 
probe. 

RNA was isolated from the tissues by the acid gnanjdmium 
tmocyanatc/phenol/chloroform method (11). RNA (15 /ig) 
was electrophoresed on a 1.2% formaldehyde gel, denatured, 
neutralized, and transferred to a nylon membrane by capil- 
lary blotting. The filter was hybridized with the human GC 
cDNA probe. After a satisfactory exposure was obtained, the 
filter was stripped and rehybridized with the mouse 0-actin 
DNA probe. For reverse transcription/PCR, 1 /ig of RNA 
was reverse transcribed using the human GC-specific oligo- 
nucleotide primers, followed by PCR amplification of the 
cDNA as described above for the DNA samples. 

Mctiryfatioa Analysis. The methylation status of the provi- 
ral 5' LTR in the CFU-S was determined by digestion of 
genomic DNA (15-25 jig) with flamffl to reduce the size of 
the DNA fragments, followed by Pvu II digestion. The DNA 
was then precipitated with ethanol, redissolved in TE buffer, 



and divided into two equal portions, one of which was 
subjected to digestion with the methylation-sensrtive enzyme 
Sma I. Completeness of the genomic DNA digestions was 
monitored by mixing a sample of the digestion mixture with 
either adenovirus type 2 DNA or A DNA (BRL), which were 
subsequently run on a \% gel. Pvu II and Pvu HI Sma 
I-digested DNA were electrophoresed and blotted to nylon 
membranes. The blots were probed with a ^-labeled frag- 
ment of the G2 vector from the Spe I site in the untranslated 
leader region to a Pvu II site near the 5' end of the GC gene 
(sec Fig. 3). Densitometric analyses were performed with the 
United States Biochemical SciScan 5000, measuring the 
relative densities of the 1.8-kb Sma I-resistant band and the 
1.5-kb Sma I-sensrtive band in each lane. 

RESULTS 

Expression of G2 in Vho in Marine HSCs. Results of 
G2-mediated gene transfer and expression in, the mouse 



PA317 




12 13 




2 3 4 5 6 



T 8 no 11 



Fig. 1. Representative Northern blot analysis of the CFU-S 
generated in the mouse model of gene transfer/BMT (experiment 1 
of Table 1). RNA from 1° CFU-S (lanes 1-6) and 2* CFU-S generated 
1 J months after primary BMT (lanes 7-11). Bone marrow cells used 
for BMT were transduced with G2 (lanes 1-4 and 7-11) or with the 
neomyem-containmg control retroviral vector (lanes 5 and 6). RN As 
from the fibroblast cell line PA317 (lane 12) and PA317 transduced 
with G2 (lane 13) were used as negative and positive controls, 
respectively, for GC mRNA. (Upper) Northern Mot was probed with 
the human GC cDNA identifying the two proviraJ transcripts, the 
full-length 4.0 kb, and the spliced 3.5 kb. {Lower) Same blot was 
stripped and rehybridized with the mouse 0-actin probe except for 
the control samples (lanes 12 and 13). 
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model of gene transfer/BMT are shown in Table 1. In the 1° 
CFU-S, a gene transfer extent of 100% was recorded in the 
three experiments performed. Furthermore, RNA expres- 
sion was detected at a high level in all nineteen 1° CFU-S 
examined. An example of RNA expression from the Mo- 
MuLV-proviral LTR is shown in Fig. 1. The two proviral 
RNA transcripts in the 1° CFU-S are depicted in lanes 1-4. 
These data confirm previous findings from our laboratory, 
which demonstrated high-level expression by G2 in 1° CFU-S 
(5). 

To confirm hematopoietic reconstitution with genetically 
marked cells, hematopoietic tissues were analyzed from mice 
1-3 months after primary BMT. The intact human GC cDNA 
was detected by genomic DNA Southern blot analysis in all 
10 spleen and marrow samples and in the thymus from two of 
four primary recipient animals. Vector RNA transcripts were 
detected, by Northern blot analysis, in all tissues in which the 
provirus DNA sequences were present (Table 1). 

Analysis of the 2° CFU-S showed a high overall retroviral 
gene transfer frequency, with 70 of 101 (70%) containing the 
G2 provirus. However, RNA expression was rarely detected 
in these tissues. In a total of three experiments, RNA 
expression from the MoMuL V-LTR was detected in only 4 of 
55 (7.3%) of the 2° CFU-S that contained the G2 provirus. The 
Northern blot in Fig. 1 shows an example of five 2° CFU-S 
that did not express proviral RNA transcripts (lanes 7-11). 
Experiment 2 was carried out further by allowing two other 
secondary BMT recipients to survive for 3 months after the 
transplantation, producing tissues stably engrafted by the 
serially passaged marrow (Table 1). The spleen, thymus, and 
marrow of the two reconstituted 2° recipients contained G2 
provirus DNA by Southern blot. Vector RN As were detected 
at a low level in the spleen and marrow, but not the thymus, 
of one of the two animals. No vector transcripts were 
detected in the other reconstituted 2° recipient. Tertiary BMT 
was also performed in experiment 2 with the bone marrow 
obtained from the reconstituted 2° recipients 3 months after 
secondary BMT. Seven of the twenty-nine 3° CFU-S ana- 
lyzed contained G2 provirus DNA; these were all derived 
from the donor 2° animal, which had shown expression of 
vector RNA in its tissues. None of the seven 3° CFU-S 
containing proviral DNA had detectable vector RNA expres- 
sion. 

Gene Transfer into Pturipotent HSC& Provirus integration 
patterns were followed in 1°, 2°, and 3° recipient animals to 
demonstrate that the 2° and 3° CFU-S, in which the LTR was 
inactive, are descended from true pturipotent HSCs (Fig. 2). 
Both the spleen and marrow of the 1° recipient, 3 months after 
BMT (Table 1, experiment 2), showed the same pattern, with 



Fig. 2. Integration patterns of the G2 retroviral 
vector by Southern blot analysis. DNA was isolated 
from the tissues (experiment 2) and subjected to 
BamUl digestion, which cuts at one site in the 
retroviral vector. 1* tissues refer to the spleen (lane 
3) and bone marrow (lane 4) of one animal sacrificed 
3 months after primary BMT. The marrow of this 
animal was used to reconstitute 2° recipients to 
generate the 2° CFU-S depicted in lanes 5-9. One 
reconstituted 2° recipient was sacrificed 3 months 
after T BMT. The spleen, thymus, and bone mar' 
row (2° tissues) were analyzed (lanes 10-12). The 
marrow from tins 2° BMT recipient was used in a 3° 
transplant to generate the 3° CFU-S (lanes 13-15). 
The extent of methylation measured for each 
CFU-S is indicated as the percentage of Sma I 
resistance. Vector expression was measured by 
Northern blot analysis and is indicated as present 
(+) or absent (-). 

five proviral integrants (Fig. 2, lanes 3 and 4). The five 2° 
CFU-S that were produced from marrow of that primary 
recipient showed segregation of the vector integrants seen in 
the primary tissues into two different patterns. The first 
pattern, represented in lanes 6, 7, and 9, consisted of four 
proviruses (although there may be slight contamination of the 
sample in lane 6 with DNA from a CFU-S with the integrant 
seen in lane 5). Interestingly, despite the same pattern of 
integrants among these three 2° CFU-S, there was strong 
vector expression seen in the one in lane 6, whereas the two 
foci seen in lanes 7 and 9 had no detectable vector transcripts. 
The second pattern of vector integrants seen in two other 2° 
CFU-S (lanes 5 and 8) had one provirus. The latter single 
integrant of =4 kb was also detected in the spleen, thymus, 
and marrow of another reconstituted secondary animal 3 
months after secondary BMT (lanes 10-12). Moreover, this 
integrant was transferred to three CFU-S of a 3° recipient 
animal. Thus, the same transduced stem cell that generated 
the 2° CFU-S was able to reconstitute all three hematopoietic 
tissues (spleen, thymus, and marrow) of a secondary animal 
sacrificed 3 months after BMT and also to generate 3° CFU-S 
in the 3° transplanted animal. The provirus present in the 2° 
and 3° CFU-S was transcriptionally inactive (Table 1). 

Methylation Analysis of the CFU-S. Genomic DNA from 
transcriptionally active 1° CFU-S and from inactive 2° and 3° 
CFU-S were compared for Sma I resistance according to the 
protocol described in Materials and Methods and illustrated 
in Fig. 3. Sma I is a methylation-sensitive enzyme that can 
cleave DNA at the CCCGGG site only if the CpG sequence 
is not methylated; therefore, Sma I resistance is used as a 
measurement of DNA methylation. Two examples of the 
resulting Southern blots are shown in Fig. 4. In the packaging 
cell line PA317, the G2 provirus gives a 1.8-kb Pvu II band 
(Fig. 4, lanes 1), which is reduced to a 1.5-kb band upon Sma 
I digestion (lanes 2). Complete digestion by Sma I shows the 
absence of methylation of the 5' LTR in the fibroblast cell 
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LSkb 

P P 

l.ftb 

S P 

EZZ ZZ Xyr ZZ 3 Probe 

Fig. 3. Map of the G2 retroviral vector showing restriction sites 
and probe used for DNA methylation analysis. 
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PA317/G2 H 



1*CFU-S 



2*CFU~S 




Fig. 4. Southern blot analysis of methyl- 
ation of the G2 pro virus in the 1° CFU-S (a) and 
T CFU-S (a and b). DNA was subjected to 
Bam HI and Pvu II with or without Sma I 
digestions. Lanes 3 and 4, labeled (-), represent 
DNA from normal control C57BL/6J mice. A 
second smaller band is seen in the PA317 cells 
and in the 1° CFU-S analyzed here due to the 
presence of a pro virus with a short deletion in 
the 5' untranslated region among the six copies 
in the packaging cell clone. 



line. All Sma I digestions of DMA from 1° CFU-S show 
reduction in size of the vector-specific band to 1.5 kb, 
corresponding to lack of methylation of the Sma I she of the 
pro virus 5' LTR in 1° CFU-S. In contrast, the provirus in all 
2° CFU-S shows some extent of Sma I resistance reflecting 
various degrees of methylation of the 5' MoMuLV-LTR in 
these tissues. 

To quantitate the degree of methylation at the Sma I site in 
the CFU-S, densitometric analyses were performed on the 
Southern blots, comparing the relative intensities of the 1.8- 
and 1.5-kb bands. Fig. 5 displays the percentage Sma I 
resistance of the G2 provirus in 3T3 fibroblasts, 1° CFU-S, 
and 2° and 3° CFU-S. In experiment 1 (Fig. 5A), the 1° CFU-S 
showed a mean of 8.9% Sma I resistance. In contrast, the 
average percentage Sma I resistance recorded in the 2° 
CFU-S generated 1.5 months after initial gene transfer/BMT 
was 24.6%, ranging from 9.0% to 47.6% over 10 samples 
analyzed. Thus, a Mold increase in percentage Sma I resis- 
tance was recorded between the 1° and 2° CFU-S of this 
experiment The difference in methylation is statistically 
significant, with P < 0.005 by Student's / test analysis, rig. 
SB represents the comparison between 1° and 2° CFU-S 
derived 3 months after initial gene transfer/BMT in experi- 
ment 2. The percentage Sma I resistance was 14.9% in the 1° 
CFU-S and increased 3.8-fold to 56.8% in the 2° CFU-S. In 
the same experiment, the 3° CFU-S generated 3 months after 
secondary BMT showed 74.7% Sma I resistance, represent- 
ing a 3.9-fold increase over the 19.3% Sma I resistance seen 
in other 1° CFU-S from the same experiment (Fig. 5C). 
Methylation of the provirus in both 2° and 3° CFU-S of this 
experiment was significantly greater than in 1° CFU-S (P < 
0.001 by Student's t test). We conclude that the transcrip- 
tional inactivity observed in the 2° and 3° CFU-S is associated 
with methylation of the 5' MoMuLV-provirus LTR at the 
Sma I site, 30 bp downstream of the transcription start site. 
Of note, the methylation status of the three 2° CFU-S with the 
common pattern of four vector integrants (Fig. 2, lanes 6, 7, 
and 9) was similar, ranging from 44% to 74%, despite the 
discordance for expression. 

DISCUSSION 

We have studied gene transfer of a MoMuLV-based retro- 
viral vector into murine hematopoietic stem cells and ex- 
pression from the MoMuLV-LTR promoter/enhancer ele- 
ments in the progeny of the transduced cells. The expression 
from the pro viral LTR was measured in the CFU-S derived 
from primary BMT performed after gene transfer, in the 



hematopoietic tissues of long-term reconstituted animals, 
and in the CFU-S generated, after serial transplantation. 

Our results demonstrate that the MoMuLV-LTR is a very 
efficient expression unit in 1° CFU-S. We have also detected 
expression from the MoMuLV-LTR in the hematopoietic 
tissues of transplant recipients 3 months after primary BMT. 
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Fio. 5. Densitometric analysis to quantitate the extent of meth- 
ylation of the MoMuLV-LTR in the r, 2°, and 3° CFU-S. The 
densities of the 1.8- and 1.5-kb bands after Sma I digestion were 
measured and percentage Sma I resistance was calculated. Each 
number represents the mean of three different readings. (A) Percent- 
age Sma I resistance of samples from experiment 1. (B and C) Results 
from experiment 2 of the 2° CFU-S generated 3 months after 
secondary BMT and 3° CFU-S generated 3 months after 3* BMT, 
respectively. In each graph, the 1* CFU-S and either the T or the 3° 
CFU-S were analyzed from the same blot. 
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These results arc comparable with previously published 
observations (4, 5, 12, 13). However, 1° CFU-S are mainly 
derived from committed progenitor cells restricted to the 
myeloid lineage (14, 15). Even the hematopoietic cells in the 
organs 3 months after BMT may be derived from progenitor 
cells capable of short-term but not long-term engraftment 
(16). By following the proviral integration pattern alter serial 
transplantation, we were able to show that the cells capable 
of forming 2° CFU-S have the characteristics of long-lived, 
pluripotent HSCs. In this stringent assay of HSCs, we have 
observed that the MoMuLV-LTR is frequently inactive in the 
resultant progeny cells derived from the HSCs. We cannot, 
however, determine whether the absence of expression in the 
2° CFU-S reflects the loss of expression by the LTR with time 
or whether the LTR is never active if inserted into the 
subclass of long-lived HSCs capable of producing 2° and 3° 
CFU-S. 

The failure of transcription from the MoMuLV-LTR in 
hematopoietic tissues is in accord with prior observations 
(3-5, 17). However, the previous studies have mostly ana- 
lyzed expression in reconstituted hematopoietic tissues of 
primary recipients. Moore et al. (17) have studied expression 
of human adenosine deaminase (ADA) by the MoMuLV- 
LTR vector expression through 2° CFU-S. Despite strong 
ADA expression in the primary recipients, they detected 
expression in only eighteen of seventy-two 2° CFU-S, al- 
though the percentage of these colonies that contained pro- 
viral DNA was not determined. 

Methylation is associated with transcriptional inactivation 
of many genes and has specifically been seen in association 
with inactivity of the transduced MoMuLV-LTR in embry- 
onic stem and embryonic carcinoma cell lines (7). Therefore, 
we examined the methylation status of the inactive 5' Mo- 
MuLV-LTR in the 2° and 3° CFU-S. The presence of a high 
copy number of endogenous murine retroviral sequences 
creates a high background in the analysis of methylation 
across the provirus 5' LTR. To overcome this problem, we 
used the restriction enzymes Sma I and Pvu II, which 
generate a specific provirus band detected by Southern blot 
analysis. Unfortunately, this assay restricted our analysis of 
methylation to one CpG dinucleotide contained in the Sma I 
site, 30 bp downstream from the transcription start site in the 
5' MoMuLV-proviral LTR. 

Our analysis has indicated striking differences in the meth- 
ylation patterns of this sequence. The Sma I site described is 
extensively methylated in the 2° CFU-S, which do not show 
MoMuLV proviral transcription, but is not methylated in the 
1° CFU-S, which do express vector transcripts. We have 
observed that this same Sma I site is completely methylated 
in vector-transduced embryonic stem cell lines, which also do 
not show expression from the MoMuLV-LTR (data not 
shown). Moreover, studies done by Singer-Sam et al (18) on 
the phosphoglycerate kinase promoter present on the inac- 
tive X chromosome have shown that methylation of a similar 
site, a Hpa II site (CCGG) at position +20, correlates with 
lack of transcription from the promoter. Nevertheless, the 
observed association between proviral methylation and ex- 
pression inactivity does not show whether methylation plays 
a causal role in suppressing expression or is merely a sec- 
ondary event after failure of expression has occurred. 

Interestingly, the association between methylation and 
expression inactivity was not complete ; at least one 2° CFU-S 
with extensive methylation at the Sma I site had a high level 
of vector transcripts. Other 2° CFU-S with the same proviral 



integrants failed to express the vector. This set of samples 
would indicate that neither integration site, methylation at the 
Sma I site, nor differentiation status of the transduced stem 
cell act to absolutely govern expression. The observed dis- 
cordance suggests that commitment to expression or inac- 
tivity may stochastically be made after the HSC has differ- 
entiated to produce multiple pre-CFU-S. 

The studies presented in this paper suggest that the wild- 
type MoMuLV-LTR may not be the ideal transcriptional unit 
for expression in pluripotent HSCs and their progeny cells. 
Successful approaches to overcoming this problem have been 
to use either modified LTRs with enhancer substitutions (19) 
or internal promoters derived from housekeeping genes (such 
as phosphoglycerate kinase or 0-actin) instead of the wild- 
type MoMuLV-LTR (20, 21). Characterization of a transcrip- 
tionally active retroviral vector in HSCs may provide a better 
understanding of the regulation of gene expression occurring 
in these cells. 
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